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Abstract 


In this thesis we present our research in the context of multiple field inflation. Current precision measure¬ 
ments of the cosmic microwave background radiation (CMB) provide compelling evidence for the paradigm 
of inflation. Simple single field inflation can fit the data well, but the precise microphysical origin of infla¬ 
tion remains unknown. We are interested in the case that single field inflation is an effective description 
of a more fundamental theory containing multiple scalar fields. Even if these extra degrees of freedom are 
all very heavy compared to the Hubble scale of inflation they still might influence the dynamics of the 
inflaton. This leads to features in the statistical properties of the temperature fluctuations in the CMB. 
We study what the possible effects are and how future data might be able to detect physics beyond single 
field inflation. In this thesis we will first introduce inflation and its current observational status. Then we 
discuss the master’s research, which consists of two projects. The first project is to provide an overview of 
studies of multiple field inflation in the literature. We translate between the different notations and defi¬ 
nitions used in various papers and study the different approximation schemes and their regime of validity. 
The second project is a numerical study of concrete models of multi-field inflation from recent papers in 
the literature. We study if the current and future experiments might be able to detect the presence of the 
additional fields in these models. 
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Chapter 1 

Outline Thesis 


Cosmology is a fascinating area of research. The last decades have provided us with a huge amount of 
experimental data which in combination with theoretical physics rapidly increased our knowledge about 
the intriguing history of our universe. The redshift-distance mappings of galaxies established the fact the 
universe is expanding and supported the idea the universe originated from some extremely hot and dense 
state. This so called Hot Big Bang model could in addition explain the abundances of light elements in 
the universe and gained important evidence from the observation of the Cosmic Microwave Background 
(CMB). In combination with the galaxy surveys it turned out the history of the universe is very well 
described by the ACDM concordance model. On the other hand, these observations gave rise to questions 
concerning the initial homogeneity of the universe and led to the idea of inflation. It was soon realized that 
inflation could also explain the origin of the primordial density perturbations, which are responsible for 
the large scale structure in the universe. These density fluctuations result into temperature fluctuations 
of the CMB, because the photons need to climb out the gravitational wells. It was a great triumph for 
the paradigm of inflation when a more precise measurement of the temperature anisotropies of the CMB 
established the generic predicted properties by inflation. 

Different scenarios for inflation lead to very similar predictions, however, they differ in the more subtle 
details. Because the current data is extremely precise, we can try to use the CMB as a probe for physics 
beyond the simplest models for inflation. The CMB provides a unique window into the early universe 
and therefore in general it is extremely interesting to understand the observational signatures of different 
inflationary models, such that we can recognize hints of new physics or falsify classes of models in the 
current and future data. In this thesis we focus on multi-field inflation and its signatures in the CMB data. 

In chapter 2 we review the basics of the standard model of cosmology. We start with a description of 
the Big Bang model and explain how inflation can solve certain problems within this theory. Then we 
discuss the observational evidence of the paradigm of inflation and outline future tests of inflation which 
can be used to learn more about the physics of the early universe. From chapter 3 on we consider the 
case that single field inflation is an effective description of a more fundamental theory containing multiple 
scalar fields. In chapter 3 we discuss the first research project which is an overview of studies of multiple 
field inflation in the literature. This allows us to introduce multiple field inflation in full detail. We 
translate between the different notations and definitions used in various papers and study the different 
approximation schemes and their regime of validity. In addition we discuss the predictions of observables 
within the approximation schemes. In chapter 4 we discuss the second research project in which we 
perform a numerical study of concrete models of multi-field inflation from recent papers in the literature. 
We study if the current and future experiments might be able to detect the presence of the additional 
fields in these models. Finally, in chapter 5 we summarize and discuss our findings. 
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Chapter 2 


The Standard Model of Cosmology 


The latest decades of precision observations have led to great advances in the understanding of our universe. 
Observations of supernovae [1, 2], maps of large scale structure (LSS) [3] and the precision measurements 
of the temperature anisotropies in the cosmic microwave background radiation (CMB) [4, 5] have resulted 
in the standard model of cosmology, the ACDM model, which is a combination of the Big Bang model 
and Gaussian, adiabatic and almost scale invariant initial conditions, as predicted by the simplest models 
of inflation. In this chapter we review the basics of the standard model of cosmology. We start with a 
description of the Big Bang model and then focus on the role played by inflation, how to realize inflation 
and its current observational evidence. 

2.1 The Big Bang Model 

The Big Bang theory can successfully account for the expansion history of the homogeneous universe, the 
existence of the CMB, the abundance of light elements and the formation of the structure. However it 
cannot explain why the universe was almost perfectly homogeneous at early times and where the initial 
density perturbations come from. In this section 1 we give a description of the Big Bang model. We first 
recap some physics of the homogeneous universe and explain the successes of the Big Bang theory and 
then we address the problems related to initial conditions. 

2.1.1 Evolution of the homogeneous universe 

Galaxy surveys [3] indicate that on the largest scales the distribution of structure in our universe is 
independent of the direction we point our telescopes. Assuming that we are not a special observer and that 
we would have measured the same distribution of large scale structure at any other position in space this 
results in the cosmological principle. The cosmological principle states that the universe is homogeneous 
and isotropic on the largest scales. The Big Bang model relies on the validity of the cosmological principle 
and on the universality of the physical laws, in particular the gravitational laws of general relativity. From 
these symmetries of the universe it follows that geometry of the spacetime of the universe is described by 
the Friedmann-Robertson-Walker (FRW) metric 

+ r 2 (d0 2 + sin 2 6d(j) 2 ) , (2.1) 

which is completely determined by the scale factor a{t) and the curvature k of the spatial hypersurfaces 
of constant time. The spatial slices can either be flat (k = 0), positively curved (k = 1) or negatively 
curved (k = —1). The scale factor represents the relative size of the universe at a given time. Often the 
line element is written in terms of comoving coordinates x l 

ds 2 = —dt 2 + a 2 (t)'-fijdx l dx ^, 

1 This section is largely based on [6, 7] where much more can be found. 


ds 2 = —dt 2 + a 2 {t ) 


1 — nr 2 
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where the spatial metric 7 y contains the information about the curvature of the spatial slices. Comoving 
coordinates can be thought of as the grid points on a coordinate grid which expands with the expansion 
of the universe. This means the comoving distance between two points stays the same all the time. The 
physical coordinates a(t)x l however imply a growth of the physical distance between two points on the grid 
as the universe expands. It turns out that the comoving frame is a very natural frame. Using the geodesic 
equation one can derive that for both massive and massless particles the physical three momentum, with 
respect to the comoving frame, decays with the expansion of the universe as p ~ A . This means that 
in an expanding universe all particles will eventually come to rest with respect to the comoving frame. 
Moreover this means that the wavelength of light is stretched with the expansion of the universe. Photons 
from distant sources will therefore be redshifted, this is quantified by the redshift parameter z 


„n ^ _ A (*o) - A(ii) 
{1) ~ A(tr) 


z + 1 


1 

a(ti) ’ 


( 2 . 2 ) 


where we have defined a (to) = 1. Redshift is being used as a time variable to describe the history of the 
late-time universe. 


The expansion history of the universe or the evolution of the scale factor a(t) can be found if we know the 
matter content of the universe. Homogeneity and isotropy requires the stress energy tensor to take the 
form of a perfect fluid 

T/tu = (p + P)U„U V - Pfj/iu- 

Covariant conservation of energy-momentum yields the continuity equation 


p + 3 H (p + P) — 0, 


(2.3) 


and the Einstein equations yield the Friedmann equations 

(2 

l = -^ + 3p ->- < 2 - 5 > 

where Mp = is the reduced Planck mass and H = ^ the Hubble parameter for which the best 

estimate today is given by Ho = (67.3 ± 1.2)kms _1 Mpc -1 , using the Planck, WMAP low-/ polarization 
and high resolution CMB data [5]. Only two of the three equations are independent, but they can all 
be very useful. From the continuity equation (2.3) we can derive how the energy density changes in an 
expanding universe. If a cosmological fluid satisfies the following equation of state P = cup then the 
continuity equation gives 

a _3(1+a;) . 

In the case of matter such as the visible nuclei and electrons we can neglect the pressure and we find 
Pm ~ a -3 . Besides visible matter which is called baryonic matter it is believed there is another matter 
component in the universe which is called dark matter. In case of a gas of relativistic particles we have 
co = l such that p r ~ a -4 . This agrees with the fact that the energy of these particles is redshifted as the 
universe expands such that it falls of with one more factor of the scale factor. This type of cosmological 
fluid is called radiation and consists of the massless particles photons and gravitons and other relativistic 
particles such as neutrinos (only in the early universe). Interestingly, it has turned out that there is 
another fluid component of the universe which has a negative equation of state co = — 1 which leads to 
a constant energy density p\ ~ 1. This mysterious cosmological fluid is called dark energy. By use of 
the Friedmann equations (2.4) and (2.5) we can determine the evolution of the scale factor in presence of 


H 2 = 


3Mp 
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these three cosmological fluids. Defining a critical density p c = 3MpH q corresponding to a flat universe 
today and the density parameters fl* = y the Friedmann equation becomes 


H 2 





( 2 . 6 ) 


where the matter density parameter consists of one part belonging to baryonic matter and another part of 
dark matter Vt m = If we know the values of the density parameters today then we can read off 

the expansion history of the homogeneous universe. The three cosmological fluids scale differently with 
the scale factor a and therefore there are different epochs in the history of the universe where its content 
was dominated by one fluid. The first Friedmann equation (2.4) and the continuity equation (2.3) can be 
combined to find the time evolution of the scale factor for a single fluid 

aH = a ~ a~^ 1+3u) -» a ~ t 2/(3+3aj) . (2.7) 

First we had an epoch of radiation domination a ~ t 1 / 2 , then matter domination a ~ t 2 / 3 and recently we 
entered an epoch of dark energy domination a ~ e Ht . 


2.1.2 The Big Bang theory and observations 

The key idea of the Big Bang model is that the universe is expanding which is in agreement with the 
observed redshift of galaxies. Hubble was the first (1929) to find that galaxies are receding from us at 
a speed proportional to the distance, which is known as the Hubble law. To understand the evolution 
of the universe from an observational point of view one needs to determine both the redshift and the 
comoving distance of the observed objects. This means additional information about distant objects is 
needed. It is believed that there are standard candles which are objects of known absolute luminosity 
(for example Cepheid variables and type IA supernovaes) and standard rulers which are objects of known 
physical size. The relation between the observed flux and the absolute luminosity of a star depends on 
both the coordinate distance and redshift. The more the universe has expanded and the further away 
the source the fainter the observed flux becomes. Similarly the ratio of the physical size of an object 
and its measured angular size yields the coordinate distance at the moment of emission. Combining the 
two types of observations one can find the coordinate distance as function of redshift. Any matter con¬ 
tent of the universe will predict a certain relation between redshift and comoving distance which can be 
derived using equations (2.2) and (2.6). The measurements today are good enough to distinguish be¬ 
tween different cosmological models and they strongly disfavor a flat matter-only universe. The best fit 
from supernovae only (SDSS, SNLS, HST and low-z) is a flat universe with 30% matter [8]. With the 
current precision measurements of the CMB the cosmological parameters have been determined to even 
greater accuracy. These observations favor a flat universe k = 0 containing 5% baryonic matter, 26% 
dark matter, 69% dark energy and a tiny fraction of about 0.01% radiation today [5]. The determination 
of the cosmological parameters using the measurements of the CMB is explained at the end of this chapter. 

If we go back in time the universe is shrinking. This means that the universe was hotter and denser in 
the past and the early universe was a thermal soup of particles. Therefore the Big Bang model can make 
predictions of non-equilibrium processes in the thermal history of the universe using the known laws of 
particle physics. Two big successes of the Big Bang theory are the existence of the cosmic microwave 
background radiation and the prediction of the relative abundances of light elements produced during 
nucleosynthesis. At temperatures above T ~ leV the universe was a plasma of photons and electrons 
and nuclei. The electrons coupled the photons tightly to the baryons through Thomson and Coulomb 
scattering. When the universe cooled down to T ~ 0.26 — 0.33eH neutral hydrogen was formed such that 
there were much fewer electrons for the photons to scatter with. Therefore at T ~ 0.23 — 0.28eF when the 
universe was 380 000 years old the photons decoupled from the matter and they could travel freely. This is 
what we observe as the CMB as first discovered by Penzias and Wilson [9]. The observed spectrum of the 
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CMB is an almost perfect blackbody with a temperature T = 2.72548 + 0.00057A" [10]. Going even further 
back in time the lightest elements could only form when the universe cooled below the relevant binding 
energies at about T ~ 100 keV when the universe was three minutes old. Using the initial conditions of 
the universe and the relevant cross-sections it is possible to calculate the abundances of these elements. 
It turns out that most of the predictions of the Big Bang Nucleosynthesis (BBN) are in reasonable agree¬ 
ment with the current estimates [4, 11], The predictions depend on the baryon density (proton, electron 
and neutron density at that time) and therefore BBN puts constraints on the baryon density. Precision 
observations of the cosmic microwave background yield an independent measurement of the baryon density. 

The universe is obviously not perfectly homogeneous otherwise there would have been no structure in the 
universe like galaxies, galaxy clusters or filaments of galaxies. Even on the largest scales of lOOMpc or 
bigger, homogeneity is just an approximation and there are actually very tiny inhomogeneities as shown 
by deep redshift surveys [3] and the temperature fluctuations in the CMB [5]. The fact that on smaller 
scales the structure becomes non-linear fits perfectly well with our picture of gravitational instability, 
inhomogeneities grow due to the attractive nature of gravity. The predicted distribution of galaxies and 
their evolution within the Big Bang model agree very well with observations [12] and therefore this is 
considered as another major success of the Big Bang theory. 

2.1.3 The initial value problems of the Big Bang theory 

The perfect blackbody spectrum of the CMB indicates that the early universe was in local thermal equilib¬ 
rium and moreover it turns out that the CMB temperature is almost perfectly isotropic with anisotropies 
of order 1CU 5 . Within the Big Bang model this is surprising since as we will see in a moment one would 
expect many causally disconnected regions of the CMB sky so why do they appear to be in thermal 
equilibrium? Moreover, where do the primordial inhomogeneities come from? In this subsection we will 
further elaborate on the initial homogeneity problem by explaining the so-called horizon problem and the 
flatness problem. In the next section we will see how inflation can resolve all the problems concerning 
initial conditions at the same time. 


In the Big Bang theory the universe has a finite age and therefore a special feature of the spacetime of 
the universe is the existence of a causal horizon which represents the maximum distance of objects whose 
signals could have reached us within the lifetime of the universe. Moreover because of the expansion of 
the universe we can also define an event horizon which represents the maximum distance light can travel if 
it is emitted today. In order to understand the horizon problem we will therefore introduce the comoving 
particle horizon and the comoving Hubble radius. To study the propagation of light it is convenient to 
redefine the radial coordinate d\ = dr/y/l — nr 2 such that we can rewrite the line element (2.1) as 

ds 2 = -dt 2 + a 2 {t) [d\ 2 + f{x)dXl 2 ] , 


where the precise form of f(x) does not matter for the moment because we will only consider radial null 
geodesics. The comoving particle horizon Xp is defined as the comoving distance a freely traveling photon 
traverses between time 0 and time t: 


X P 




——din a. 
aH 


( 2 . 8 ) 


In standard Big Bang cosmology the universe has up to now been dominated by radiation and matter, 
therefore a good estimate of the comoving particle horizon is given by Xp ~ a i n the radiation era and 
Xp ~ a 1 / 2 in the matter era. Performing the last integral for a fluid with uj > —1/3 one can derive 


2 1 

Xp ~ 1 + 3 u^H' 
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This means that in all epochs of the standard Big Bang theory the particle horizon coincides more or less 
with the comoving Hubble radius -jjj. Therefore one would expect that scales beyond, say, twice the Hub¬ 
ble radius are not in direct causal contact. The particle horizon at the time of the CMB is much smaller 
than the full CMB sky we observe today. One can compute the number of causally disconnected regions of 
the CMB sky [13] which turns out to be about 3 x 10 3 patches. On the other hand we see the regions in the 
CMB sky are in almost perfect thermal equilibrium. This is very unnatural and called the horizon problem. 

Moreover we know from the CMB measurements that the universe is very close to flat today \k/o 2 Hq \ < 
0.01. Defining the critical density corresponding to a flat universe at any time p c {t) = 3 MpH 2 we can 
parameterize the flatness of the universe by = Ptot ~ Pc = This means the universe must have 

been very flat in the far past. If there had been a tiny deviation from flatness initially the universe 
would have either collapsed or become diluted too quickly. This second part of the very unnatural initial 
homogeneous conditions constitutes the flatness problem. 

2.2 Inflation 

We have seen in the previous section that the Big Bang theory can successfully account for many obser¬ 
vations in our universe. However, within the Big Bang theory it cannot be explained why the universe 
was so homogeneous at the time of recombination and there is no explanation of where the primordial 
inhomogeneities come from. In this section we explain how the paradigm of inflation can solve both prob¬ 
lems at the same time 2 . Furthermore, we explain the mechanism of inflation in its simplest realization 
and see how its generic predictions are established by current observations. As a historical note, an early 
stage of inflation was proposed by Starobinsky [17] and Guth [18]. Guth explained that it could solve the 
horizon and flatness problem, however the inflationary models of Guth were problematic in the sense that 
no radiation was produced after inflation. Shortly afterwards slow-roll inflation was proposed by Linde 
[19] and independently by Albrecht and Steinhardt [20]. 


2.2.1 Inflation as a solution to the horizon and flatness problems 

Instead of assuming extremely fine-tuned initial conditions we actually want a mechanism which makes 
sure that all the regions in the CMB are causally connected such that causal physics can homogenize and 
flatten the universe. From equation (2.8) we can deduce we need an epoch of decreasing comoving Hubble 
radius in the very early universe 

*&)<»• 

This means the particle horizon can become arbitrarily large depending on how long this epoch lasts. 
Therefore the Hubble radius and the particle horizon are not equivalent anymore. The particle horizon 
depends on the full history of the universe and regions separated by a distance larger than the particle 
horizon could never have communicated. The Hubble radius does not know about the history of the 
universe which means that regions separated by a distance larger than the Hubble radius could have 
communicated in the past. However regions beyond the Hubble radius cannot easily communicate now, 
because the Hubble radius is approximately the distance a photon can travel within one expansion time. 
This can be seen if you compute this distance for a fluid with equation of state to using equation (2.7) 


/ —— dlna 

Ja aH 


2 

1 + 3c jj 


25(1+301) 


1 

aH' 


which is indeed equal to the Hubble radius up to a factor close to 1 for the fluids matter, radiation and 
dark energy. In the case of an decreasing Hubble radius the regions cannot communicate at all because 

2 This section is largely based on [14, 15, 16]. 
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the Hubble radius provides a bound on the distance a photon can travel 


f 00 1 

/ -din a 

Ja aH 


2 1 
— 1 — 3c jj aH 


The Hubble radius is in this case proportional to the event horizon. 


An early epoch of decreasing Hubble sphere is called inflation. Alternatively one can derive equivalent 
definitions for inflation from the Friedmann equations (2.4, 2.5) and equation (2.7). One could define 
inflation as a period of accelerated expansion, 


a > 0 , 


or a period where a fluid with negative pressure dominates 

p + 3 P <0 —> cu < — —, 

O 

or finally a period of slowly varying Hubble parameter 

H _ dlnH 
~ dhT < ' 

This condition says that the relative change of the Hubble parameter during one expansion time is small. 
We defined N as the number of expansion times of the universe (e-folds) such that dN = dlna = Hdt. 
The limit of H constant is called de Sitter inflation, which results in exponential growth of the scale factor 
a ~ e Ht . Since inflation has to end we need to deviate from perfect de Sitter, instead we would like to 
have quasi-de Sitter inflation, and we can introduce the parameter e to parameterize this deviation 


H 




H 2 ’ 


such that for successful inflation we need 0 < e < 1. Since inflation is supposed to solve the horizon 
and flatness problems, inflation should last long enough. Therefore we can introduce another kinematical 
parameter pn which represents the relative change of the e parameter during one e-fold 

d In e e 

VH = ~ 2dN ~ ~2H~e' 

If this parameter is small then it says that e does not vary much during one expansion time and it ensures 
that the period of inflation is elongated. We can compute how much inflation we need in order to solve 
the horizon and flatness problem. The Hubble radius at the beginning of inflation should be at least as 
big as the Hubble radius today. We assume the Hubble parameter is more or less constant during inflation 
Hj ~ 10 l3 GeV (see the next section) and after inflation there is only an epoch of radiation domination. 
Using equation (2.7) this yields the following ratio of the Hubble radius now and at the end of inflation 


aoHo a e y/H 0 2 7 

- r^j - r-~j - ^ r^j y U 

a e H e a 0 yj Hj 

where we used the value for the Hubble radius today Hq ~ 10 -33 eU. Therefore we can deduce the minimal 
ratio of the scale factor at the end of inflation to the scale factor at the beginning of inflation 

Oe _ OeO 0 _ ^-27.5^ > ^ ^q27 

a / ao at ai Hq 

We need therefore N ~ 271n(10) ~ 62 to solve the horizon and flatness problems. 


10 



2.2.2 How to realize inflation 


Although a universe filled with a fluid with negative pressure is hard to imagine, it’s not hard to realize 
inflation theoretically. The simplest models consist of a scalar field (/>, called the inflaton , minimally 
coupled to gravity: 

S = I 

We assume we will have sufficient inflation such that we can safely take the FRW metric for g^ such that 
the inflaton field is homogeneous 0(f,x) = 4>(t). The Friedmann equations yield the following equations 


iR+lg^d^d^-V^) 


(2.9) 


H 2 = 


3M|, 




H = - 


2Mp ’ 


which are consistent with the equation of motion for the scalar field 

<j> + 3 H(j) H—— = 0. 
dcp 

Let us study these equations in more detail. If we would like to have a period of slowly varying Hubble 
parameter we need the potential energy V(4>) to dominate over the kinetic energy \(t> 2 ■ From the equation 
of motion we see that a Hubble parameter acts as a friction term and the field derivative of the potential 
as a force. This means that in order to keep the kinetic energy small the acceleration of the field has to 
be small or equivalently the potential should be flat enough. Because the inflaton rolls slowly down the 
potential, this type of inflation is therefore called slow-roll inflation. We can define slow-roll parameters 
which keep track of whether this situation is satisfied or not 


H 




H 2 ’ 



The slow-roll approximation consists now of taking e, \r/\ <C 1. The second condition implies that the 
friction term dominates in the equation of motion of the scalar field 


3H(j> + 


dV 

d(f> 


0 . 


If this approximation is valid then this implies similar smallness conditions on the gradient and Hessian of 
the potential and one could equivalently define potential slow-roll parameters. If the gradient and Hessian 
of the potential are small compared to the potential energy density itself, then this implies slow-roll. This 
situation is however not generalizable to any inflationary model, in particular multiple field inflation which 
we will study in the main part of this thesis. Therefore we do not discuss them in order to avoid confusion. 
Comparing the slow-roll parameters with the kinematical parameters e and r/n introduced in the previous 
subsection we see that if the slow-roll approximation is valid, then we are in a quasi-de Sitter period, as 
required for successful inflation. During inflation most of the energy density is in the potential energy 
density. When the inflaton reaches the minimum of the potential most of the energy is transferred to 
the kinetic energy of the inflaton such that inflation ends. After this the inflaton should decay in all the 
particles of the Standard Model. This process is called reheating. After thermalization of the particles 
the standard Hot Big Bang era begins. 


2.2.3 An example: quadratic potential 

Because there are many possible models for inflation one might be worried that it cannot be falsified. 
However all the different scenarios for inflation lead to very similar predictions which are confirmed by 
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the data as we will explain later on. On the other hand each particular model may differ in the more 
subtle details of the predictions which allows one to learn more about the underlying physics of inflation 
by the current and future cosmological precision measurements. For now we do not worry about what 
the inflaton represents, but first we try to get more understanding of the predictions of inflation by use 
of a particular example of slow-roll inflation: chaotic inflation. We will first introduce chaotic inflation in 
this subsection and in the next section we will use the same model to study more advanced predictions of 
inflation. Chaotic inflation is a subset of single field models of the form (2.9) with a power-law potential 
which allows for slow-roll inflation. We will take the quadratic potential as concrete example, i.e. 

V{4>) = ^m 2 (j) 2 . 


Assuming we already entered the stage of slow-roll, the field equation and Friedmann equation become 

i _ , tt 2 _ rn 2 (j) 2 


3 H 


and //" 


6 Af 2 


which leads to the following expression for the slow-roll parameters 


e 



and ijh 



( 2 . 10 ) 


Here we derived ijh using the approximate expression for e. We can estimate how many e-folds of inflation 
awaits us if we start at a particular value for the field fa making the assumption we have slow-roll until 
the end of inflation. Then the end of inflation is given by the field value 

e = 1 —y cf) e = s/~2 Mp. 


Now the number of e-folds until the end of inflation can be computed as follows 


N e -N i = 


fae r<Pe TJ 

dln(a) = / Hdt, = / — dcj) ~ — 


f ‘ S-dcfr*- f ' 

JSi 0 Jd>i 


2Mp 


dcj) = 


AM 2 


( 2 . 11 ) 


This expression will be useful when we study the generation of initial density perturbations in the next 
chapter. 


2.2.4 Inflation as the origin of structure 

The Big Bang theory can explain the distribution of galaxies and their evolution assuming a certain 
distribution of initial inhomogeneities which grow due to the attractive nature of gravity. However, the 
origin of the primordial inhomogeneities remains elusive and this introduced another problem concerning 
initial conditions. We have seen so far that inflation can make the universe very homogeneous and 
isotropic. On the other hand tiny quantum fluctuations in the inflaton field are unavoidable and it was 
soon realized that these fluctuations can account for all the structure in the universe. The physical picture 
is that quantum fluctuations in the inflaton field are quickly stretched to large scales by the extreme 
expansion of the universe. These quantum fluctuations are frozen as soon as they reach scales beyond 
the Hubble radius 3 where they become the initial density fluctuations of the universe to seed all structure 
in the universe. Inflation therefore not only solves the horizon and flatness problem, but it also provides 
a mechanism to produce the initial density perturbations, which explains the structure we see around 
us, which is extremely exciting. Starobinsky already noted in his early paper [17] that quantum effects 
during this inflationary epoch would be important and calculated a spectrum of gravitational waves. 
Mukhanov and Chibisov calculated the spectrum of density fluctuations for the first time [21] but it was 
done independently by several groups at a workshop in Cambridge [22, 23, 24, 25]. Observations of the 
CMB confirm these predictions to high accuracy [26, 4, 27] which provides compelling evidence in favor 
of inflation. In this section we will outline the calculation of the lowest order statistics of both the density 
perturbations and the gravitational waves at the end of inflation. 

3 This will be called ‘Hubble radius crossing’ from now on. 
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Cosmological perturbation theory 


We study the behavior of arbitrary quantum fluctuations with respect to the classical spacetime back¬ 
ground and fields. Perturbations in the matter fields are related to perturbations in the metric by the 
perturbed Einstein equations which can be very complicated to keep track of, due to the non-linear nature 
of the equations. Moreover the physical interpretation of these perturbations is a bit obscured by the 
freedom in the choice of coordinates, there is no preferred coordinate system. Perturbations can pop up 
if you pick different coordinates, which means that these are actually not physical. The way to resolve 
this is to introduce gauge-invariant variables, i.e. physical variables which do not depend on the chosen 
coordinate system. The perturbations are assumed to be small and for our purposes it is sufficient to 
expand all equations to linear order in the perturbations and therefore we can use standard cosmological 
perturbation theory 4 . The treatment of the perturbations is a lot easier at linear order for two reasons. 
First of all the Fourier modes of the perturbations evolve independently which can be derived using the 
the spatial translation symmetry of the background. Second because of the spatial rotation symmetry of 
the background one can derive that the modes with different helicity evolve independently at linear order 
as well. We will explain this last statement by applying it directly to single field inflation. The most 
general way to parameterize linear perturbations in this theory is given by 

ds 2 = (1 + 2 <h)dt 2 — 2 a(t)Bidx l dt — a 2 (t)[( 1 — 2’k)<5jj + 2 Eij\dx l dx\ 
c/>(t,x) = 0(0) + <500, x), 


where E^ is symmetric and traceless. The helicity of a perturbation mode is defined by the way the 
amplitude changes under rotations around its Fourier vector k. The amplitude of a mode of helicity m 
gets multiplied by e imd where 9 is the rotation angle. The helicity of different 3-tensor components can be 
most easily derived in the helicity basis e 3 ~ k and e± = (ei ± ie 2 )/\/ 2 , which are defined to transform 
as e±™e± under rotation around k. Now given an arbitrary component of a contravariant 3-tensor T we 
have 

rri / _ i6(n+—n-)rji 

~ e ~jl •••jm ’ 

where n+ and n_ are the number of plus and minus indices in j \,... j m . This means the perturbations 
can be divided into scalar, vector and tensor perturbations, which correspond respectively to modes with 
helicity 0, ±1 and ±2. The scalars <b, 'k and <50 are zero-rank tensors and therefore scalar perturbations. 
The 3-vector Bi can be subdivided in two vector perturbations and one scalar perturbation. The scalar 
perturbation is parallel to k and the vector perturbations are orthogonal to k which means that in real 
space we can decompose Bi as follows 

Bi = diB + BY, with &BY = 0. 


Using the same reasoning we can decompose the symmetric and traceless 3-tensor Eij into scalar, vector 
and tensor perturbations. For the scalar perturbation we find for example 


Ejj — c\k Sij -(- C' 2 kjkj 



hk , 





E, 


where we used the traceless property to eliminate one of the constants. Furthermore we have 

EY = diEj + djEi with d l E t = 0, 

d'El = 0 . 

The decomposition of the perturbations in scalar, vector and tensor types reduces the complexity of 
deriving the perturbation equations. The scalar perturbations are captured by 

ds 2 = —(1 + 2<k)df 2 — 2a(t)diBdx l dt + a 2 (f)[(l — 2\k)<5jj + 2 didj E\dx l dx\ 

0(0 x) = 0(t) + <50(0 x). 

4 This subsubsection is based on [28] 
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Here we absorbed a part of Ef- in 'M^. Note that there are less true degrees of freedom because we still 
need to incorporate the gauge freedom and the Einstein equations. The scalar gauge transformations 

remove two degrees of freedom. The Einstein energy and momentum constraint equations remove in 
addition two more degrees of freedom. This means we are left with one scalar degree of freedom. For 
the actual computation we would like to fix a particular gauge, therefore we should take care that we are 
only considering physical quantities. It turns out to be useful to work with the following gauge invariant 
variable 

- , Oj cj) 

q = ad<j>+ — V, 

the so-called Mukhanov-Sasaki variable which will be the canonically normalized quantization variable. 
Another very useful variable is the curvature perturbation on uniform density hypersurfaces 

TZ = T + ^<50, 

</> 

which has the important property to freeze out on super Hubble scales. It becomes constant at a certain 
time during inflation and it only starts evolving again in an epoch of known physics much later than the 
epoch of inflation. This means that the relevant modes of the curvature perturbation are insensitive to 
the unknown physics after inflation and and therefore this variable is used to compute observables. In 
order to simplify the perturbed Einstein equations one can fix a gauge and rewrite everything in terms 
of the gauge-invariant variables. Some popular gauges are the Newtonian gauge (B = E = 0), the uni¬ 
form density gauge (<9 Tq = E = 0) such that 1Z = \k which explains its name and the comoving gauge 
{dTf = E = 0) such that TZ = T which explains the fact that TZ is also called the comoving curvature 
perturbation. 

Solving the Einstein equations and the equations of motion of the perturbed field leads to the linearized 
equations of motion for the gauge-invariant perturbations. This computation is a bit tedious and since we 
just gave all the ingredients to execute it we just state the result. After performing a Fourier transform 
we get 

q" + (k 2 -=0<z = O, (2.12) 

with z = \/2 a 2 e. Here the prime denotes a derivative with respect to conformal time r defined by 
adr = dt. Before studying this equation in more detail we should not forget about the vector and tensor 
perturbations. These are fortunately much easier to deal with. First of all vector perturbations decay 
with the expansion of the universe, therefore even if they were present we still can neglect them. The 
tensor perturbations are described by 

ds 2 = — dt 2 + a 2 (t)[§ij + 2 hij\dx‘ l dx\ 

with dihij = h\ = 0. It is possible to decompose hij in the eigenmodes of the spatial Laplacian V 2 as 
h t j = x h(t)e+j X (x) such that after performing a Fourier transformation the only equation defining 
the tensor perturbations is 

k 2 

h + 3Hh H— ~h = 0. 
a z 

Note we are studying Fourier modes of the perturbations with comoving wavevector k. Defining the 
variable v = ah we can rewrite this equation as 

v" + [k 2 - ^ v = 0. (2.13) 
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This equation is almost equivalent to (2.12) and if we can solve the equations for the scalar perturbations 
then we immediately have the solution of the tensor perturbations. Therefore we will only focus on the 
scalar perturbation equations and come back to the gravitational waves in the very end. Expressing the 
coefficients of equation (2.12) in terms of the slow-roll parameters we get 

q" + ( k 2 — a 2 H 2 (2 + 2e — 3rj + r/£ — 4 er] + 2e 2 )) q = 0. 

From here we can see that we can divide the evolution of the perturbations roughly in two regimes. 
During inflation the Hubble radius 1 /aH will decrease quasi-exponentially in time. This means that a 
perturbation of sub-Hubble scale k 3> aH will eventually grow to a perturbation of super-Hubble scale 
k <C aH if there is enough inflation. At sub-Hubble scales the coefficient k 2 dominates over a 2 H 2 and 
by neglecting the a 2 H 2 -term. we can solve the equation exactly. Within the slow-roll approximation it is 
possible to find an analytical solution of the full perturbation equation. First of all we can derive 

H' = (l-e)H -> 

T 

such that we get the following equation 




where 



This equation is equivalent to the Bessel equation and the solution is given by a linear combination of the 
Hankel functions of the first and second kind 



(2.14) 


The initial conditions of the perturbations are given by the underlying physical idea that they arise as 
quantum fluctuations. 


Quantizing the perturbations 

To quantize the theory we first need to know the action to quadratic order in perturbations. The simplest 
way to derive this is to go back to the original action, perturb the inflaton field and ignore the spacetime 
fluctuations for the moment. We take 0(t,x) = c/>o(t) + Sc/>(t,x.) such that the relevant part of the action 
(2.9) becomes 

drd 3 x ^a 2 ((cj )' 0 + 5<j )') 2 - (di<f > 0 + did 4 >) 2 ) - a 4 (v((f>o) + V^o )5(j> + ^V^(</>o)<^ 2 + • • • 

The action to first order in the perturbations vanishes because of the background equations of motion. 
The action to second order in the perturbations becomes 

S (2) = j drd 3 x ^a 2 ((Sc/)') 2 - ( diSq i) 2 ) - a^V^^Sc/) 2 . 

Changing variables to q = aScj) this becomes after some manipulations 

S (2) = ^ J drd 3 x [(q') 2 - ( dtq ) 2 + (W + H 2 - a 2 V H ((/) 0 )) q 2 ] . 




Now from this action follow the same equations of motion as for the Mukhanov-Sasaki variable q as given 
in equation (2.12) up to some slow-roll corrections. If you would have picked the spatially flat gauge then 
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q = q which means this is very close to the quadratic action in perturbations. The correct normalization 
follows from here and therefore the full action to quadratic order in perturbations becomes 

S (2) = \J drd 3 x (q ') 2 - (d ig ) 2 + ^q 2 . 

Note that we have canonical kinetic terms, which means that the Mukhanov-Sasaki variable is the one we 
should use for canonical quantization of the perturbations. 

We know how to quantize the canonical coordinate and momentum fields q and n = q'. We promote them 
to operators 

q —>• q, and it —>■ if, 

and impose the standard equal-time commutation relations. 

[g(r,x), 7 r(r,y)] = i5(x - y) (2.15) 

Expanding q in terms of its spatial Fourier modes q^] 

« (r ’ x) = / (^ 7 ^ fik(r)eik ' X ’ 

the equations of motion are given by equation ( 2 . 12 ) which we repeat here while keeping the index k for 
clarity 

Qk + (k 2 ~ ^ Qk = 0 

The solutions of the equations of motion for q k are given by complex mode functions qk (and 7 T k = q k ) 
such that the general solution of q k is determined by a constant complex vector a 

qk = qk a k + qt a -k 

and the solution for 7Tk becomes 

71k = ^k a k + 7T k a -k- 

Note that the reality condition q^(r) = q~k( T ) is manifestly satisfied. After promoting the vector and 
its complex conjugate to the operators and a^, we would like to interpret them as annihilation and 
creation operators respectively, which satisfy the usual commutation relations 

[dk, Oq] = <5(k — q). (2.16) 

These are consistent with the commutation relations (2.15) if and only if the mode functions are normalized 
as follows: 

qt^k ~ KQk = i, (2.17) 

Another constraint on the mode functions comes from imposing initial conditions. We assume that at the 
beginning of inflation the initial state of the universe is the vacuum state | 0 ), defined by dk | 0 ) =0 for all 
scales k of interest, such that there is no initial particle production, i.e. the Bunch-Davies vacuum. This 
statement implies that at the beginning of inflation when k dominates all other scales in the equations of 
motion the universe is in the state of minimal energy and the Hamiltonian does not contain any terms 
proportional to dd and a) a). After some algebra the energy of the ground state is given by 

<0| H |0) = ^ J d 3 k<5 3 (0) Tr k TT* k + k 2 q k q* k - y q k q* k . 
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Therefore at the beginning of inflation tq, which is assumed to be deep inside the sub-Hubble regime where 
we can assume k —, we have in addition to (2.17) the following constraint: 

\TTk\ 2 + k 2 \q k \ 2 is minimized (2-18) 


Writing the sub-Hubble solution to the equations of motion for qi- as 

q k = ae ikT + be~ ikr . 

we can derive from the constraints (2.17, 2.18) that the initial solution is given by 5 


Qk{r) 



(2.19) 


for some random phase A. This completely determines the evolution of q^. Completing now the analysis 
in case of slow-roll inflation as given by (2.14) we find the following early time limit 


lim q[r) 

— kr— 



Therefore we have A = 0 and B = * 2 (^+ 2 ). After the modes have crossed the Hubble radius we 

find therefore the following late time solution 


lim 

— kr —>-0 


q{r) = B 


V2T((5) 

7r 



TQ3) / 2 y- 1 / 2 
V2kir \-kr) 


Transforming to the curvature perturbation 7Z we hnd at late times 


U V2eM p a q 2^~eM p k^ (2-20) 

here we used the slow-roll approximation for which we have /3 ~ § and — ^ ~ aH. It has been proven by 
Weinberg [29] that the curvature perturbation 1Z freezes out at super-Hubble scales. This expression for 
1Z should therefore be evaluated for each mode at Hubble radius crossing k = aH. 


2.2.5 How to characterize the perturbations 

In order to relate the observed temperature anisotropies in the CMB to the predictions of a particular 
theoretical model of inflation we compute the power spectrum of the density fluctuations. The power 
spectrum is the Fourier transform of the real-space two-point correlations and contains all the information 
about the distribution of density perturbations in case they are Gaussian 6 . For every mode k the value 
of the power spectrum represents a variance crFrom the CMB we have learned that the distribution is 
very close to Gaussian, so it is a good first characterization of the perturbations. To learn more about 
the distribution of perturbations however, one should compute higher order correlations in addition. The 
dimensionless powerspectra Vtz for TZ^ are defined as 

(0|7t k (r)7t k /(r) |0) = <K k - k^^-Vnik, r), 

such that 

/ ,/3n 

—V n (k,T)e- ik <*-y\ 

5 Strictly speaking, the Buncli-Davies initial conditions can only be imposed when inflation is eternal, such that one can 
take the limit r —> — 00 . 

( ’More precisely, all information about a Gaussian distribution of density perturbations is contained in the Gaussian density 
matrix, which is determined by the qq, qn and iffr two-point correlations. 
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In the case of slow-roll inflation we plug in equation (2.20) into this definition and we find 


_ _ H 2 
n ~ 8n 2 eMp k=aH ' 

The power spectrum is evaluated at Hubble radius crossing because of the freeze out of the curvature 
perturbation. Since we are in the slow-roll regime the functions H and e are not precisely constant 
but rather slowly varying in time. This means that the power spectrum has a slight scale dependence, 
since different modes cross the Hubble radius at different times and therefore the amplitude of the power 
spectrum changes slightly as well. Before we compute the scale dependence we should not forget about the 
spectrum of gravitational waves. The computation goes in a similar fashion as for density perturbations 
and we won’t repeat the full analysis. The important steps are to solve equation (2.13), to find the 
canonical quantization variable which is M p ^. Finally, after transforming back to h, the power spectrum 
for gravitational waves is given by Vt = 2 Vh because there are two polarizations. This results in 

2 H' 2 

1 ~ 7 T 2 Mf k=aH ' 

Note the difference with the power spectrum of scalar perturbations, there is no factor of e. This means 
that a measurement of the amplitude of the power spectrum of gravitational waves would be a direct 
probe of the Hubble scale of inflation. Furthermore the power spectrum of tensor perturbations is also 
scale dependent because H changes in time. For slow-roll inflation the power spectra are determined by 
the amplitude and its tilt to a very good approximation 

/ k \ Ua ~ l 

V n = A s (k,) l^-j , 

/ u \ nt 

V *= A ^ u) • 

Here A s and At are the amplitudes of the scalar and the tensor power spectrum and A;* denotes a pivot 
scale at which they are evaluated. The parameters n s and nt represent the tilt of the power spectra and 
are called the scalar spectral index and the tensor spectral index respectively. The scalar spectral index 
can be computed as follows 


d In Viz d In Vtz dN 
d In k dN d In k 


and similarly we find the tensor spectral index 


—2e + 2 rjH, 


n t 


—2e. 


Furthermore we can define the tensor-to-scalar ratio as 


A t 

?’ = —- = 16e, 
A„ 


which indicates the relative amount of tensor modes compared to the scalar modes. The observables A s , r, 
n s and nt can be related to experiment. The observable scales cross the Hubble radius 60 — 53 e-folds before 
the end of inflation such that the slow-roll approximation is valid. Slow-roll inflation predicts therefore 
two spectra with a red tilt, i.e. more power on large scales. The tensor power spectrum can only get a blue 
tilt if the weak energy condition is violated, p + P > 0. Moreover, slow-roll inflation predicts a background 
of gravitational waves, although the relative amplitude depends on the precise model. In the next section 
we will discuss the splendid agreement of the inflationary paradigm and the current observations, but first 
we will get some feeling for the values of these observables. 
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An example: quadratic potential 


Let us compute the values of the observables in case of chaotic inflation with a quadratic potential. Since 
the interesting modes cross the Hubble radius between 60 and 53 e-folds before the end of inflation the 
value of the field can be estimated using equation (2.11). This yields 


<^>60 ~ 15.5-Mp and <(>53 ps 14.6Mp, 

plugging this into equation ( 2 . 10 ) the slow-roll parameters take the approximate value 


e = ~t]h 


15 2 


This yields the following estimate for the value of the observables for chaotic inflation 


m 


n s ss 0.96, r « 0.14, rot ~ 0.02 and A s « 53— 
s ’ M| 

The measured amplitude of the power spectrum determines the value rro, and therefore inflation predicts 
the shape of the spectra and their relative amplitudes. 


We can use the measured value of the amplitude of the power spectrum of scalar perturbations to determine 
the value of the Hubble parameter if we know the details of the inflationary model. The best fit for the 
amplitude A s as measured by Planck is A s = (2.196 ± 0.06) • 10 ~ 9 and this fixes the value of rro. We can 
express H in terms of the tensor-to-scalar ratio r as follows 


H 2 


7 T 2 rM 2 A s 
2 


H 


2 • 10 ~ 5 VrM Pl , 


note that we have changed from the reduced Planck mass M p to the Planck mass Mpp In case of chaotic 
inflation therefore we get a Hubble scale of inflation of 


H ~ 10 13 GeH, 


which is about 6 orders of magnitude below the Planck scale. 


The Lyth Bound 


From the concrete example of chaotic inflation with a quadratic potential we see that the inflaton has to 
traverse a large distance in field space A </> ~ 15M p in order to create a fair amount of gravitational waves. 
More quantitatively, we can deduce a relation between the tensor-to-scalar ratio and the evolution of the 
inflaton field by inserting the definition of e into r = 16e 



Assuming we have at least 30 e-folds of slow-roll inflation from the time N = 60 where the observable 
modes cross the Hubble radius we find the Lyth bound [30]: 

Ac/) f 60 /rv ( r* x 1 / 2 

Mp -/ 3 o (oToi) ’ 

where r* is the measured value of the tensor-to-scalar ratio. The Lyth bound provides a lower bound 
of the distance the inflaton moves in field space for a given tensor-to-scalar ratio. This means that if 
the tensor-to-scalar ratio turns out to be large enough to be detectable r > 0 . 01 , then the inflaton will 
traverse a large distance in field space and therefore the inflaton will therefore explore a larger part of the 
potential. In particular if inflation is embedded in a multi-field inflationary model it means the inflaton 
is expected to be more sensitive to the other degrees of freedom. Moreover, we saw in the example of a 
quadratic potential that the Hubble scale will be about 6 orders of magnitude below the Planck scale if 
r > 0.01. If these extra degrees of freedom are much heavier than the Hubble scale, then the dynamics of 
the inflaton might be sensitive to physics only a couple of magnitudes below the Planck scale. 
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2.3 Inflation and observations 


In the previous section we have seen that inflation can resolve the initial homogeneity problem of the Big 
Bang theory. At the same time, inflation also creates tiny initial inhonrogeneities which seed all structure 
in the universe. In this section we explain how precision observations of the CMB provide compelling 
evidence for inflation and moreover how we can use these observations to learn more about the early 
universe. 

2.3.1 Evidence in favor of inflation 

The standard model of cosmology, the ACDM model, is the simplest model which combines the Big Bang 
theory and inflation. It is described by only six free parameters: the density parameters of baryonic matter, 
dark matter and dark energy, the optical depth (which we do not discuss here), the scalar amplitude A s 
and the scalar spectral index n s . Everything else can be derived from these parameters. This simple model 
agrees well with the cosmological data. The Planck collaboration [5] has determined all six parameters to 
high accuracy by measuring the temperature anisotropies. In particular for the inflationary parameters 
Planck has found 

A s = (2.196 ± 0.06) • 10 -9 , and n s = 0.9603 ± 0.0073, 

which are evaluated at the pivot scale A;* = O.OSMpc^ 1 . A scale-invariant power spectrum has been ruled 
out with almost 6a. However, the tensor-to-scalar ratio is assumed to be negligible. Gravitional waves 
contribute to the temperature power spectrum in a specific way and therefore including them in the analy¬ 
sis is expected to weaken the constraints on n s . The Planck collaboration analyzed also an extension of the 
standard model with r as an additional parameter and found n s = 0.962 ± 0.011 with 2a certainty, which 
makes the result very solid. Together with the observation of super-Hubble correlations with coherent 
phases this provides strong evidence in favor of inflation. We will explain these latter statements in words 
in the section on CMB anisotropies below. 

Concerning primordial gravitational waves, Planck has set an upper limit on the tensor-to-scalar ratio 
assuming the ACDM model extended with the tensor-to-scalar ratio and found r < 0.12 with 2a certainty. 
Probing the value of r requires a measurement of the so-called B-mode polarization of the CMB as we 
explain in the section on CMB anisotropies. Recently a detection of primordial B-modes has been reported 
by the BICEP2 collaboration [31] with the best-fit value of 

r = 0.2 ±0.07, 

where r = 0 is disfavored at 7a. If this is a true detection of primordial B-modes, this will be a revolution 
for the paradigm of inflation. There is some tension with the upper bound provided by Planck, but the 
measurement of Planck comes from the temperature power spectrum and is therefore model dependent. 
Moreover, it is noted by the BICEP2 team that accounting for the contribution of foreground dust might 
shift this value down. 

CMB Anisotropies 

The anisotropies of the CMB temperature and polarization can provide very accurate information about 
the matter content of the universe and the initial conditions laid down by inflation'. We will explain what 
information on the cosmological parameters we can extract from the CMB anisotropies and make more 
clear why the observations are in outstanding agreement with inflation. 

The temperature fluctuations of the CMB are described by the power spectrum which contains all infor¬ 
mation about the statistical properties if the fluctuations are Gaussian. The fluctuations are expanded 

7 This subsection is an excert of [32] where everything is explained in much more detail. See also [7]. 
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in spherical harmonics. The power spectrum for a given multipole l is proportional to the absolute value 
squared of the multipole moment of the temperature fluctuation field Q. For low values of l there is an 
absolute limitation to the accuracy of the power spectrum because there are only few samples to take 
the mean of. This is called cosmic variance. Furthermore when the measurement is averaged over bands 
A/ ~ l this leads to another error. For l ~ 100 we already have a very precise measurement if the other 
sources of noise can be suppressed. 

Let’s see what information we can extract from the temperature power spectrum. Before decoupling the 
universe consisted of a tightly coupled photon-baryon plasma. By analyzing the behavior of this cosmo¬ 
logical fluid it is possible to understand the existence of the peaks and troughs and their location. These 
properties depend not only on the matter contents of the universe but also on the initial conditions for 
which a particular form is assumed, namely a nearly scale-invariant power spectrum as predicted by sim¬ 
plest models for inflation. The initial distribution of density perturbations sets the initial conditions for 
the oscillator equation of the photon-baryon fluid. The pressure gradients in the photon density will act 
as a restoring force and any initial perturbation will therefore oscillate with the speed of sound c s . The 
temperature perturbation 0 at the end of recombination in the most idealized case is given by 

rv* 

©(??*) = 0(0) cos (fcs*), with s* = / c s chj. 

JllO 

On the largest scales kc s <C 1 the perturbations are still frozen and therefore this provides a direct mea¬ 
sure of the initial conditions. On the smaller scales the fluctuations evolve until the end of recombination, 
such that some of the modes are catched at their maximum and some at their minimum. These spatial 
inhomogeneities appear as an angular anisotropy today and we see a series of acoustic peaks. The peak 
locations depend very sensitively on the curvature of the universe. The spatial curvature of the universe 
acts like a lens, a positive curvature makes objects look closer than they actually are. The converse holds 
for a negative curvature. The position of the first peak therefore strongly suggests a flat universe. This is 
another breakthrough for inflation since it predicts flatness of the universe. At the same time in order to 
get those peaks there should have been initial fluctuations with super-Hubble correlations with a nearly 
scale invariant spectrum and furthermore perturbations which start to evolve as soon as they enter the 
Hubble radius which uniquely determines the relationship between the phases of the oscillations. Inflation 
explains all these non-trivial observations at the same time. If you would consider an alternative theory 
where curvature perturbations are generated only inside the Hubble radius where the phases of the per¬ 
turbations are arbitrary, the peaks would be washed out. 

The gravitational potential also provides a gravitational force to the oscillator. Gravity compresses the 
fluid and pressure resists. This means we will have both pressure gradients k@ and potential gradients 
k T. In addition one should take into account the redshift of the photons by the gravitational potential. 
In the absence of baryons the oscillator equation remains the same as without gravity, but then for the 
‘effective temperature perturbation’ 0 + T. This is also the measured temperature perturbation since after 
recombination the photons need to climb out of the potential wells and therefore redshift in temperature. 
The initial temperature perturbation can be deduced from the initial gravitational potential perturbation. 
The initial Newtonian potential can be interpreted as a local time shift 5t/t = \k. The temperature of the 
photons goes like the inverse of the scale factor. Therefore, using equation (2.7), the initial temperature 
fluctuation is given by 0 = — |(1 +u;) -ll I / which equals 0 = — \k/2 in the radiation era and 0 = — 2T/3 
in the matter era. This means for a potential well T < 0 the effective temperature is negative which 
means the plasma begins rarefied in gravitational wells. Due to the gravitational potential the fluid will 
get compressed and by the resistance of the pressure it gets rarefied again. This means the first peak 
corresponds to the mode where the fluid exactly has compressed once at recombination. 

When baryons are included they provide extra inertia to the oscillator. The resulting effect can be under¬ 
stood by understanding the oscillation equations as a spring in a gravitational field where the inclusion 
of baryons increases the mass. This leads to enhanced oscillations and a shift of the mean value to 
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© = — (1 + R)^>. Here R is the photon-baryon momentum density ratio. Since we still observe the effec¬ 
tive temperature 0 + 'h only the peaks are enhanced so this has obvious observational consequences. The 
value of R and hence the amount of shifting depends on the baryon density. 

Taking into account the correct matter-to-radiation ratio changes the history of the universe and hence 
the size of the sound horizon at recombination, which is ambiguous with other effects. Fortunately there is 
another distinguishable effect of radiation, namely it makes the gravitational force evolve with time. The 
Poisson equation governs the evolution of the gravitational potential. Because of the radiation pressure 
the background density decreases with time which makes the gravitational potential decay. This induces 
a driving effect, because when the fluid is compressed the gravitational potential has gone and in addition 
the redshifting goes away. As soon as the universe is matter dominated the gravitational potential becomes 
dominated by the behavior of the dark matter. This means the amplitudes of the peaks are increased with 
a decreasing value of the dark matter-to-radiation ratio. In addition, when the gravitational potential is 
destroyed, the baryon loading effect should also be eliminated. Therefore the third peak indicates that 
dark matter exists and that it dominates at recombination. 

Then finally there is another effect which damps the oscillations. It can be thought as a random walk of 
the photons that makes hot regions colder and cold regions hotter. The photons exchange momentum with 
electrons by Thomson scattering and they drag along the protons because of the tight Coulomb coupling. 
Anisotropies of the order of the damping scale or less therefore decrease and the oscillations beyond the 
third peak are suppressed. The damping length depends on the amount of baryonic matter. 

Besides temperature fluctuations in the CMB it is expected that a part of the photons are polarized due 
to Thomson scattering before the surface of last scattering. Polarization is a vector field therefore and 
it turns out to be convenient to decompose it in the so-called Fi-modes and F>-modes [33, 34], which are 
defined as the parity even and odd components of the photon polarization. Scalar perturbations only 
produce Fi-modes and tensor perturbations create both. This means that a detection of F>-modes would 
be an unique signature of primordial gravitational waves. Moreover, the peaks in the Fi-mode spectrum 
are related to the peaks in the temperature spectrum because they originate from the same physics and 
therefore a combined probe provides an important cross check of the assumptions of the underlying model. 
The cross-correlation of the temperature fluctuations and the Fi-modes have been measured and this has 
provided the strongest check of phase coherence of the initial conditions which is in striking agreement 
with canonical single field inflation. 

It should be noted that the observed temperature fluctuations and polarization of the CMB photons also 
carry along information about the physics after recombination. These are called secondary temperature 
and polarization anisotropies. This means we can in addition learn many things about the evolution 
and structure of the universe. On the other hand this limits the precision of the predictions because the 
primordial physics can be degenerate with foreground effects. 

2.3.2 Observing physics beyond the simplest models of inflation 

We have seen that precision observations of the CMB provide compelling evidence in favor of inflation. The 
generic predictions given by single held slow-roll inflation are confirmed by the data. However, if we would 
like to learn more about the precise microphysical origin of inflation it is necessary to deviate from the 
standard assumptions. We summarize the main observables to look for signatures beyond canonical single 
held inflation. Canonical single held inflation is defined as single held slow-roll inflation with canonical 
kinetic terms, minimally coupled to gravity and with Bunch-Davies initial conditions. 
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Deviation from a power law spectrum 

Standard inflation assumes a power law scaling of the power spectrum. A deviation from slow-roll could 
result in a detectable correction to the power spectrum which allows for a change in the tilt 

/ u \ n s -l+^a s ln(fc/fc*) 

The parameter a s is called the spectral running and for slow-roll inflation it is expected to be very small 
because it is second order in the slow-roll parameters. Using the Planck data alone there are no signs of a 
deviation from power law. The BICEP2 result however could point out there is a running to compensate 
for the high value of r, which requires new physics to explain. Another possibility to deviate from a power 
law spectrum is to have features on top of it. This is something we are interested in for this thesis. 

Non- Gaussianity 

The distribution of perturbations is fully characterized by the power spectrum if it is Gaussian. Single 
field inflation predicts a negligible amount of non-Gaussianity 8 and therefore much can be learned from 
a detection of non-Gaussianity, in particular on interactions of the inflaton with other fields. The main 
observable related to non-Gaussianities is the 3-point correlator or its Fourier transform, the bispectrum 

<0| ^.ki^-k 2 ^-k 3 |0> = (27r) 3 B^(fci, k 2 , /c3)e>(ki + k 2 + k 3 ), 

where the delta-function comes from translational invariance of the statistics. The three momenta therefore 
form a triangle and one can consider different shapes if also scale invariance of the bispectrum is assumed. 
The Planck collaboration has tested for different shapes. Local non-Gaussianity corresponds to the limit 
of squeezed triangles and such a detection would rule out all single field models. Planck [35] has put very 
tight constraints on this type of non-Gaussianity and this excludes for example many multi-field models 
with more than one light field. Equilateral non-Gaussianity is constrained by Planck but there is room for 
a future detection. This type of non-Gaussianity can arise in single-field models, so we need a detection 
or more precise constraints to say more. The Planck collaboration has not published the full bispectrum, 
which means that there might be more information in the observational data in case the bispectrum is 
scale dependent. Furthermore, future experiments of large scale structure such as the Euclid mission [36] 
are designed to probe non-Gaussianity to higher accuracy. 

Non-adiabatic perturbations 

Single-field inflation predict only fluctuations in the total energy density of the matter, or equivalently 
only fluctuations in the net curvature perturbation. This is because there is only one field involved. If one 
would allow for multiple light fields it is possible to have fluctuations in the relative densities of the different 
matter components. These type of fluctuations are called non-adiabatic perturbations S. The effect of a 
surviving spectrum of non-adiabatic perturbations Vs on the spectrum of temperature fluctuations is that 
they will result in peaks which are out of phase with the acoustic peaks coming from V-r. By means of 
a detection of the cross-correlation of the E-modes spectrum and the temperature spectrum the relative 
ratio of the power spectrum of non-adiabatic perturbations compared to the power spectrum of adiabatic 
perturbations is strongly constrained by Planck [27] 

—^ < 0.036. 

V-r 

8 In canonical single field inflation, interactions are suppressed by slow-roll parameters and therefore the bispectrum is 
suppressed by a combination of the slow-roll parameters times the square of the power spectrum, which is too small to be 
detectable. 
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How this exactly translates into a constraint on multi-field inflation is model-dependent. Isocurvature 
perturbations present at the end of inflation can get partially converted to curvature perturbations. This 
means that inflationary models which produce isocurvature perturbations are not necessarily ruled out. 
For example, in curvaton models of inflation a significant fraction of the observed adiabatic perturbations 
could originate from isocurvature perturbations. This is discussed in section 10.3.2 of [27]. Moreover, the 
evolution of the non-adiabatic perturbations depends presumably on the physics after inflation. 


Tensor tilt 

If future polarization experiments can confirm the detection of a tensor-to-scalar ratio of order r ~ 0(0.1) 
then there is hope to detect the tensor spectral index as well from the temperature power spectrum. In 
single field slow-roll inflation we have the following relation between the tensor tilt and the tensor-to-scalar 
ratio 

r 

n t = 


such that a disagreement with this relation would also be a hint of new physics. 


24 



Chapter 3 


Multiple Field Inflation: a Translation of 
Different Studies in Literature 

3.1 Introduction 

In the previous chapter we have seen that inflation can explain why the universe is so homogeneous and 
flat and that it can account for tiny primordial density fluctuations which are the seeds of all structure in 
the universe. Moreover, the current observations of the temperature and polarization anisotropies of the 
CMB provide compelling evidence in favor of inflation [27, 31]. Now we have very precise measurements 
of the CMB and hints that inflation took place only a few orders of magnitude below the Planck scale, it 
is interesting to investigate if we can learn more about fundamental physics beyond the standard model 
by using the observational data. Since inflation is formulated in a field theoretical framework it is in 
particular possible to study models motivated by supergravity and string theory. These models typically 
consist of multiple scalar fields [16]. This means that the dynamics of the inflaton could have been 
influenced by the other degrees of freedom and therefore it is highly interesting to explore observational 
distinguishable effects compared to canonical single field inflation 1 . For this we need to have a good 
theoretical understanding of multiple field inflation. In this chapter we provide an overview of a part of 
the literature on multiple field inflation. The studies use different notations and definitions which can give 
rise to much confusion when one would like to understand which approximations are precisely made. In 
the first part we recap the general framework of inflation and discuss the slow roll and turn parameters 
while we provide dictionaries which translate between the different notations and definitions used in the 
literature. In the second part the different approximations and their regimes of validity are discussed. An 
overview of the literature studied can be found in table 3.1. The shortcuts listed in this table are used in 
the dictionaries to denote the corresponding part of the literature. 


1 Canonical single field inflation is defined as single field slow-roll inflation with canonical kinetic terms, minimally coupled 
to gravity and with Bunch-Davies initial conditions. 
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Authors 

Papers 

Shortcut 

Gordon, Wands, Bassett and Maartens 

[37] 

GW 

Groot Nibbelink and van Tent 

[38] 

TG 

Cremonini, Lalak and Turzynski 

[39, 40] 

CT 

Achucarro, Atal, Cespedes, Gong, Hardeman, Palma and Patil 

[41, 42, 43] 

AP 

Peterson and Tegmark 

[44, 45] 

PT 


Table 3.1: An overview of the main literature studied in this chapter. The shortcuts are used in the dictionaries 
to denote the corresponding part of the literature. 


3.2 Framework of multiple field inflation 

In this section we recap the general formalism of multiple field inflation with Einstein gravity. We fix our 
notation and give the translation to notation used in the selected papers in table 3.4. 


3.2.1 Set-up 

We study multiple field inflation in which the matter content of the universe is described by an arbitrary 
number of fields <fi a with possible non-canonical kinetic terms which are minimally coupled to gravity. The 
full theory is given by the following action 


S= d A x\f^g 


A/f 2 1 

_Ti? _ -g^GMd^ducf ~ V(r 


(3.1) 


where g is the determinant of g^ w , Mp is the reduced Planck mass and R the Ricci scalar constructed from 
spacetime quantities. Latin indices are used to denote the scalar fields which are at the same time treated 
as coordinates on the field manifold they span. Greek indices are used for the usual spacetime coordinates. 
We assume the universe is homogeneous and isotropic on large scales and spatially flat, which means that 
the background is described by the Friedmann-Robertson-Walker metric 

ds 2 = —dt 2 + a 2 (t)6ijdx l dad = g^dx^dx". (3.2) 


The matrix G a b describing the non standard kinetic terms is symmetric, positive-definite and smooth and 
therefore can be interpreted as a field metric belonging to a smooth field manifold. The potential V is 
continuous and bounded from below, but besides that completely arbitrary. 


3.2.2 Covariant formalism 

In order to derive compact and held coordinate invariant equations describing the evolution of the clas¬ 
sical background and the quantum fluctuations we use the covariant formalism with respect to the held 
manifold developed in [38]. Moreover, to understand the properties of the background held trajectory it is 
convenient to change basis in held space by defining directions along and perpendicular to the background 
trajectory hrst introduced in [37]. 

We rewrite the equations manifestly in tensor form to make all the equations held coordinate invariant. 
We use G a b an d its inverse G ab for the shorthand notation to lower or raise indices when we take dot 
products. Furthermore since d a A b '" c for some arbitrary tensor A does in general not transform like a 
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tensor, we need to define a covariant derivative V a such that when applied on tensors it will return another 
tensor 

V Ab- _ a Ab... I pfe AX... _ pa; a b... , 

v c... — u a si- c ~r 1 ax s\ c 1 ac sl x i • ■ ■ • 

The connection coefficients T£ c are chosen as 

r a bc = l -G ad {d b G cd + d c G db - d d G bc ), 

such that the covariant derivative is metric compatible. Furthermore we can also define a covariant 
derivative on the field manifold with respect to any of the spacetime coordinates 

= (c^ a )V a , 

which still creates tensorial quantities because <j) a transforms as a scalar under spacetime coordinate 
transformations. Finally all information of the curvature of the field manifold is contained in the Riemann 
tensor 

pa _ c\ pa pa . pa px pa p# 

^ bed bd ®d*- be'*- ex*- bd *- dx*- be ? 

which can be contracted to the Ricci tensor R ab = R x aXb and the Ricci scalar R = R a a . 


Let us now study the basis induced by an inflationary trajectory. Consider an arbitrary curve in field 
space parameterized as 4> a (t). This allows us to define a proper field distance along the curve given by 


cr = 


G a b(t> a 4> b dt. 


which is strictly increasing in time for a quasi de Sitter inflationary trajectory. This defines a field speed 
along the curve 

d = \J G ab <fi a <j) b . 

If we use (T as parameter along the curve we get the following tangent vector along the curve 



which is automatically normalized. We can define the directional covariant derivative 


(3.3) 


d(b a 

D a = --V a , 
da 


which can be used to define parallel transport of tensors along this curve as having a vanishing directional 
covariant derivative. Geodesics in field space can be defined as paths which parallel transport their own 
tangent vector 

D a T a = 0 -> D t T a = 0. 

We know the evolution of the fields will be along geodesics unless a force pushes them off. This role is 
fulfilled by the potential term in the action. If minus the gradient does not point in the same direction 
as tangent vector of the field trajectory, it will deviate from a geodesic. Therefore it is useful to define a 
normal unit vector to the trajectory as follows 


N a = s N (t) (G bc D t T b D t T c ) 


1/2 


D t T 


a 


(3.4) 


which is only well defined when D{T a is nonzero. This is the reason the function sjv(t) = ±1 is intro¬ 
duced 2 , which indicates the orientation of N a with respect to D t T a . We let sjv(i) just take some initial 

2 As pointed out in [41]. 
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value at a given time where DfT a is nonzero such that we know the initial orientation of N a . During the 
intervals where D b T a vanishes N a will be defined such that it is a continuous function of t , furthermore 
S]\r(t ) may flip sign such that when D^T a becomes nonzero again, N a stays a continuous function of t. 
This corresponds to a right-handed or left-handed orientation in the two dimensional case. The rate of 
change of the tangent unit vector T a can therefore be in both the +N a direction and in the — N a direction 
and will be proportional to the gradient of the potential in the normal direction. 


In this chapter we try to keep our expressions as concise as possible, which means we will introduce 
some more simplifying notation, likewise as done in the papers we study in this overview. In particular, 
a potential source of confusion is the notation used for the gradient and higher order derivatives of the 
potential and its projection along the tangent and normal directions. For clarity, we fix our notation here. 
The gradient and Hessian of the potential are denoted by 

v a = v a v, v ab = v a v b v. 

In case of two fields, the projections of the gradient and the Hessian of the potential along the tangent 
vector T a and the normal vector N a are written as 

V T = T a V a , V N = N a V a , and V TT = T a T b V ab , V TN = V NT = T a N b V ab , V NN = N a N b V ab . 

We will use similar notation if we project other quantities, which have not been defined yet, along the 
tangent and normal directions by using the indices T and N. 


As a final note, the equations are given as function of coordinate time t, but one could equivalently use 
another time variable. Two particularly useful time variables in the context of inflation are conformal 
time r as defined by the relation dt = adr and number of e-folds of inflation N as given by Hdt = dN. 
We don’t work with a special time coordinate but interchange between t, r and N depending on which 
variable is most convenient. Derivatives with respect to these time variables are denoted as 


d_ 

dt 


c..o = (.:.), 




A dot, a prime and a semicolon denote derivatives with respect to time t, conformal time r and number 
of e-folds N respectively. 


In the rest of this section we first explain how to derive the equations describing the evolution of the 
classical background and introduce the slow-roll parameters and the notion of turns. Then we continue 
with the perturbations equations and their quantization for any number of fields. In the end we simplify to 
the case of two fields and we project the perturbations equations along the tangent and normal directions 
to study the influence of the background dynamics on the evolution of the perturbations. We provide 
dictionaries to translate between the different notations and definitions used in literature along the way. 


3.2.3 Background equations and slow-roll and turn parameters 

Assuming the cosmological principle and that the universe is spatially flat, we can use the spacetime 
metric defined in equation (3.2). Minimizing the action yields the homogeneous equations of motion for 
the scalar fields and the Friedmann equations, 


M a + 3 H<\> a + V a = 0, 


H 2 = 


1 


3Mp 


& 2 + V 


H = — 


a 


2Mp ’ 


(3.5) 
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where H = a/a is the Hubble parameter V a a short-hand notation for the gradient of the potential 
G ab \7bV. In order to solve the equations of motion for the perturbations later on, we need to understand 
how the perturbations are influenced the background dynamics. Based on our intuition from single field 
inflation we can use a slow-roll approximation to simplify the equations. Since the inflaton is in general 
not moving in a single field direction it is convenient to change basis by using the unit vectors pointing 
along and perpendicular to the background trajectory as defined in section 3.2.2. This allows us to define 
slow-roll parameters in multiple held inflation. Furthermore the trajectory can be forced to deviate from 
a geodesic in held space by the shape of the potential and therefore the tangent and perpendicular basis 
vectors may rotate into each other. This can be parameterized by so called turn parameters. We now 
make the notion of slow-roll and turns more precise and compare our dehnitions with the dehnitions used 
in the literature. 


Geometric slow-roll parameters and turn parameters 

We project the background equations of motion (3.5) along the tangent T a and normal direction N a and 
hnd 

(3.6) 

with V a = X7 a V = VrT a + Vi\rN a . The first equation coincides with the equations of motion for single 
held inflation. The second equation shows that the inhationary trajectory deviates from a geodesic when 
the normal component of the gradient of the potential is nonzero. This motivates the following dehnitions 


H 

(3.7) 

* a = ~Wa D ^ 


which can be divided into the slow roll parameters e and rfi 


1! arri & 

(3.8) 

and the turn parameter r] 1 

^ 3 3 jk- 

(3.9) 

Only if the background trajectory is a geodesic of held space then D t T a 
the deviation of the inhationary trajectory from a geodesic. For later 
parameter 

= 0 and therefore r} 1 parameterizes 
use we also define a third slow roll 

= 

(3.10) 

and a second turn parameter 


P ± _ ^ 

^ Hr] 1 - ’ 

(3.11) 


which are only well defined if both a and r/ 1 - are nonzero, but in all the equations they will appear as 
and anyway so we do not need to worry about this. Note that the definition of the normal vector 

N a implies the following signatures 

sgn(??- L (t)) = - s N (t ) ->• sgn(r/ ± (f)IV a ) = -1, 

which means that for any equation the number of r/- L, s and N a, s in front of each term should be all odd 
or even. 
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The slow roll approximation can now be defined as e,r^,£^ <C 1. The first two approximations e,r^ <C 1 
represent that the potential is flat in the tangent direction, such that the held rolls slowly down the 
potential. To see this rewrite the first order slow roll approximation as 


€ < 1 
7^ <C 1 

e" < i 


d 2 « V 


Vt 

T < V 
9 H 2 


3 (?/^ +1 


H 2 


Vt 


V 

3 M 2 ’ 


[Mi 


Vt 

V 


< 1, 


H 2 


a 


9 T a V a V T 
3 M 2 p -< 1, 


(3.12) 


with Vt = T a X7 a V , which is the projection of the gradient of the potential along the tangent direction. 
The approximation <C 1 assures the hrst two approximations will be prolonged because it says that the 
held acceleration a does not change much during an expansion time. Similarly we can define the slow-turn 
approximation t/ 2 ,^ <C 1, but note that this approximation has nothing to do with facilitating quasi de 
Sitter inhation as the slow-roll approximation does. Moreover, if the slow-turn approximation is valid it 
will suppress the multi-held effects, because they are parameterized by r/ 2 . 


Kinematical slow-roll parameters 

Equivalently one can use a kinematical definition of the slow roll parameters in order to define various 
orders of the slow roll approximation. This is in particular useful if e changes adiabatically during the 
times where the observable modes cross the Hubble radius. In this case one can construct a derivative 
expansion of e where the corrections will be suppressed. The hrst slow roll parameter e parameterizes the 
deviation from the de Sitter inhation such that the zeroth order slow roll approximation e = 0 is equivalent 
to the de Sitter inhation. Since inhation should end this parameter is in general nonzero and at the end of 
inhation it becomes of order 1. Many e-folds before the end of inhation we expect it is possible to assume 
the hrst order slow-roll approximation e <C 1 and e ~ const., which means we assume quasi-exponential 
inhation, i.e. the Hubble parameter does not change much during one expansion time. The second order 
slow roll approximation allows e to vary in time but assures the quasi-de Sitter regime will be prolonged. 
We extend the same reasoning to the third order slow-roll approximation. This motivates the alternative 
definition of the slow roll parameters 


tH = 

VH = 
^H = 


H 

H 2 ~ e ' 
ch 


2 Hen 

m 

Him 


= r/" - e, 

7/11 (£ll — 7/11 


- 3e) + 2e 2 


7/ii — e 


(3.13) 


where the index H belongs to the kinematical slow-roll parameters. We expressed the kinematical slow 
roll parameters in terms of the geometric ones to be able to easily interchange them. From the expressions 
in terms of the held speed and its accelerations it seems that, when the background and perturbation 
equations are expressed as function of number e-folds, these slow roll parameters will appear naturally, 
therefore we will choose to work with these parameters later on. Similarly to the various orders of the 
slow roll approximation we can define a zeroth, hrst and second order slow turn approximation. We list 
all order slow roll approximations in table 3.2. 


Slow-Roll-Slow-Turn (SRST) approximation 

The SRST approximation is dehned as taking both the slow roll and the slow turn approximation [44]. 
We dehne the hrst (or second order) SRST approximation as a combination of the hrst (second) order 
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Slow roll approximation 

Neglect 

Slow turn approximation 

General meaning 
During transition 

e, t? 11 , C 11 « 1 or e H ,V h,£h < 1 

No prescription 

T] P , <C 1 

Zeroth order 

£H = VH = £,H = 0 

0(e H , tjh,^h) 

7] 1 =£-*-=() 

First order 

e# 1 and rjn = £h = 0 


1 and £-*- = () 

Second order 

\vh\ < e H < 1 and i H = 0 

0(e 3 H ,e H m,VH^H) 

k-H < i T/ 2 - 1 < i 


Table 3.2: Summary of the slow roll and slow turn approximations introduced in section 3.2.3. The first row gives 
the slow-roll and the slow-turn approximation in the general sense. The second row states the approximations at 
various orders, which can be used when one is considering a period consisting of only a couple of expansion times, 
such as the transition from the sub-Hubble to the super-Hubble regime. 


slow-roll and the first (second) order slow-turn approximation. Within this approximation we 
the non-geometric slow roll parameters and slow turn parameters in terms of the gradient and 
of the potential. Assuming <C 1 we get the following SRST approximation of the 

equations of motion for the fields 


3 (j)’ a + 


V a 

W 


0, 


can express 
the Hessian 
background 


where the semicolon denotes a derivative with respect to the number of e-folds N and where we denoted 
V a = X7 a V. This yields the following expressions 


(l) _ m p (Yl 


e v ' = 


Vh } = M 2 P 


V 
I Vtt 


Vr\ 

V 


-L(!) _ ?\2 ( V NT V N V T \ 

- Mp {~v v^J- 


(3.14) 


Here we used the notation V TT = T a T b V a V b V , V TN = V NT = T a N b V a V b V and V NN = N a N b V a V b V. 
Plugging these expressions into the background equations we get the following improved expressions 


Ih = df - m p (ys 


VtVtt\ V ± Vtn + Vj-'j-' 

) V 


p 2> = d 


.(1) 


— Mp 2u ; 


(V$V N 
^ V 3 


VtVnt\ + Vtt\ 

v 2 ) v J ' 


(3.15) 


This allows us to express the observables in terms of the original model defined by the potential and 
field metric. The tangent and normal vector are however not determined without solving the background 
equations. 


We notice that the SRST approximation implies that the gradient squared MpViV b /V 2 and the Hessian 
of the potential MlV b V a /V, except for MIVnn/V, are small along the inflationary trajectory. We expect 
that a converse relation also holds. If one assumes the slow roll approximation and if the full Hessian of the 
potential is small, we expect that the turn parameters are of the same order as the slow roll parameters. 
We did not manage to find a solid proof, but presumably the argument goes as follows. The slow roll 

approximation implies that ^ is small along the trajectory. Assuming there is one point along 

the trajectory where the turn rate vanishes such that Rv = 0 we expect that the smallness of the Hessian 
implies that Vjy can only increase by Vnt times the distance along the trajectory by the estimate 

V N « [ dcf N a \7 b V a = [ da V NT - 
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A more physical argument is that we expect that a very flat potential cannot force the inflationary 
trajectory to deviate from a geodesic too much. 


Useful expressions 


We derive a couple of useful expressions which we often use for manipulating the equations. First of all the 
derivatives of e and can be used to convert from the geometric to the kinematical slow roll parameters 

e ; = 2e(e-?? 11 ), 

( 7] \\y = -jyll^ll -e-7/ 11 ), 


which gives in particular 

?? ll^ll = r) H £ H + + 5e) + 2e 2 . 

Next we can express the slow-roll parameters in terms of the field speed and its accelerations 


e = 


(T? 
2Mp ’ 


o" a”' o" 

VH = -r, iff =-T + — 

a- o'” O' 


With the following expression we can easily convert the Hubble parameter to the potential and vice versa. 


V = (3 - e)MpH 2 . 

Finally during the transition regime we can express T~L and — explicitly in conformal time. We assume the 
second order slow-roll approximation and a Taylor expansion of e around Hubble radius crossing yields 


e = e* — ke*rj H *(T - r*), 


where t* is the time of Hubble radius crossing for a given mode k and e* the value of e at that time. 
We can neglect e*?/#* in the second order slow-roll approximation (see section 3.2.3 and Table 3.2) and 
therefore we can derive 


w 


(1 - e)H 2 


with 


T~i ~ —(1 + e* + e 2 ) 

T 

/cr* ~ —(1 + e* + e 2 ), 


and 


a 

a 


2 + 3e* + 4e 2 


(3.16) 


where we fixed the value of the conformal time at Hubble radius crossing for each mode k in order to keep 
the expressions as simple as possible. The first order slow-roll approximation renders the same expressions 
but then up to linear order in e. 


Dictionary slow-roll and turn parameters 

Finally we compare our slow-roll parameters with the ones used in the other papers in Table 3.3. Note 
that the slow-roll parameters used in the papers of TG and PT are ‘contaminated’ with turn parameters. 
This means their slow-roll approximation implicitly makes use of the slow-turn approximation. Their 
parameters can therefore only be used in the SRST approximation. In the more general case where one 
would like to study the influence of turns, one could use one of the sets of slow-roll parameters and the 
turn parameters defined section 3.2.3. These parameters allow one to study the slow-roll contributions 
and the contributions from turns in the trajectory separately. This is important in order to understand 
the observational signatures of multi-field inflation. 

3.2.4 Dictionary background equations 

A translation of all notation introduced so far to the notation used in the other papers can be found in Table 
3.4. In this dictionary we exclude the slow-roll and turn parameters, since these are already translated in 
Table 3.3. Besides the slow-roll and turn parameters, the definitions of the quantities introduced so far 
to describe the set-up of multiple field theory are equivalent in all papers. This means the dictionary is a 
pure translation of symbols. 
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Quantity 

Here 

AP 

TG 

GW 

CT 

PT 

Slow-roll parame- 

e or e H 

e 

e 


e 

e 

ters 

77II or r] H 

V\\ 

fj\\ = - v \\ 

_ 

_ 

771 = ~a'r] H 


^ or £ h 

£|| 

fll = — T ]- 1 ' 2 + 

- 

- 

Ci = ~ <T ’((.V ± ) 2 + VhZh + Vh) 

Turn parameters 

77- L 

Vx 

fj 1 - = —T] 1 - 


- 

m = -CT'T] 1 - 



£1 

f 1 = rj L {e + 277II + £-*-) 


- 

6 = a'r] ± ( 2 r] H + ^) 









Table 3.3: Comparison of the slow-roll parameters defined here and the ones used in the other papers. The 
abbreviations used in the first row are explained in Table 3.1. When there is a difference in definition we indicate 
this in blue and with an underline. 


3.2.5 Perturbation theory 

Ultimately we would like to compare the predictions from multi-field inflation with the small anisotropies 
in the cosmic microwave background radiation. The background fields are responsible for a spatially 
homogeneous and isotropic universe. Therefore we have to study the behavior of arbitrary quantum 
fluctuations with respect to the classical spacetime background and fields. Assuming they are small we can 
expand all equations to linear order in the perturbations. The observed temperature fluctuations can be 
related to these quantum perturbations. By assumption the expectation value of the perturbations vanishes 
therefore we can characterize the perturbations by the Fourier transform of the two-point correlation 
function which is a measure of the variance of their distribution. In this subsection we explain how 
to derive the equations of motion for the perturbations and in section 3.2.6 they will be quantized and 
related to observables. There will be overlap with chapter 2 section 2.2.4, in particular when we discuss 
the quantization, but we repeat this in order to have a consistent chapter. 

Cosmological perturbation theory 

Because we treat the perturbations to linear order we can therefore use standard cosmological perturbation 
theory, see for instance [?]. Using the spatial translation symmetry of the background it follows that 
at linear order the Fourier modes of the perturbations evolve independently. Moreover because of the 
spatial rotation symmetry of the background one can derive that the modes with different helicity evolve 
independently at linear order as well. This means the real perturbation of the metric can be divided into 
scalar, vector and tensor perturbations, which correspond respectively to modes with helicity 0, ±1 and 
±2. The most general way to parameterize linear scalar perturbations is then given by 

ds 2 = —(1 + 2 4>)dt 2 + 2 a(t)diBdx l dt + a 2 (f)[(l — 2'$)8 l j + 2didj E]dx l dx\ 
cj) a (t, x) = cj) a (t ) + 5cj) a (t, x). 

Note there are fewer true degrees of freedom because we still need to incorporate the gauge freedom and 
the Einstein equations. The scalar gauge transformations 

t->t + f, x l -X E + d l £, 

remove two degrees of freedom. The Einstein energy and momentum constraint equations set in addition 
two more degrees of freedom to zero. This means we are left with n scalar degrees of freedom, where n is 
the number of scalar fields in our multiple field model. For the actual computation we would like to fix a 
particular gauge, therefore we should take care that we are only considering physical quantities. It turns 
out to be useful to work with the following gauge invariant variables 

Q a = 5(t> a + 4 *, 

H 
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Quantity 

Notation 

here 

Notation 

AP 

Notation 

TG 

Notation 

GW 

Notation 

CT 

Notation 

PT 

Spacetime metric 

57 w 

9fj,u 


9biv 

9fj,u 

9fj,i' 

Spacetime derivative 

d n 

d n 

d„ 

d,d 

d,d 

d 
dx a* 

Cov. derivative on spacetime 




- 

- 

- 

Conformal time 

V 

T 

V 

- 

r 

- 

Comoving time 

t 

t 

t 

t 

t 

t 

Number of e-folds 

N 

N 

N 

- 

N 

N 

Derivative w.r.t. r/ 

(••)' 

(••)' 

(••)' 

- 

(••)' 

- 

Derivative w.r.t. t 

(■•) 

(■•) 

(■■) 

(■■) 

(■■) 

(■■) 

Derivative w.r.t. N 


- 

- 

- 

- 

(■■)' 

Field metric 

G ab 

'Yab 

Gab 

_ 

- 

Gij 

Field derivative 

da 

9a 

9a 

- 

(■■)* 

- 

Metric compatible connection 

T bc 

r l 

r l 

- 

- 


Riemann tensor 

Dfl 

11 bed 

Kc d 

pa 

11 bed 

- 

- 


Cov. derivative on M. 

V a 

V a 

V a 

- 

- 

Vi 

Cov. der. on A4 w.r.t. 

D, 

D 
dx p 


- 

- 

D 

dx^ 

Field coordinates 

0 a 


0 a 

Vi 

X,0 

0 i 

Field velocity vector 

0 a 

fo 

<t> a 

Vi 

X,0 

0 l 

Field speed 

a 

00 

\0\ 

a 

a 

Hv 

Field acceleration 

a 

00 

\0\ 

a 

<7 

H{Hv)' 

Field-gradient potential 

V a 

V a 

V a P 

v VI 

V x ,vp 

ViP 

Double gradient potential 

V b P„ 

V bVa 

V ft V a V 


V xx etc. 

V,-ViP 

Tangent direction 

rpa 

rpa 

e l 

- 

K 

e i 

Normal direction 

N a 

N a 


- 

El 

e 2 

Projection of V a along T a 

V T 

v<t> 

- 

Pa 

Va 

- 

Projection of V a along N a 

V n 

V N 

- 

Ps 

V s 

- 


Table 3.4: This dictionary provides a translation of all notation introduced up to section 3.2.5 to the notation used 
in the other papers. This concerns notation to describe the set-up of the multiple field theory and the background 
equations. The abbreviations used in the first row are explained in table 3.1. 


or equivalently the generalized Mukhanov-Sasaki variables q a = aQ a . By using the basis induced by the 
inflaton trajectory we can separate Q a into perturbations Q T along the trajectory and n— 1 perturbations 
orthogonal to the trajectory of which Q N is one. Adiabatic or curvature perturbations proportional to Q T 
correspond to perturbations in the total energy density or to the curvature of equal time hypersurfaces. The 
entropy or isocurvature perturbations (proportional to Q N and the other components of Q a ) correspond to 
relative perturbations in the energy densities of the different fields while leaving the total energy density 
and curvature of the spatial slices unchanged. In the end we will compute the two point correlation 
function of the curvature perturbation on uniform density hypersurfaces 


n=-Q 

a 


T 


which has the important property to be conserved on super-Hubble scales k <C aH when the isocurvature 
perturbations are negligible. If that is the case it means that the curvature modes can be evaluated 
after they are well outside the Hubble radius and they will only evolve in time when they enter the 
Hubble radius again at late times where the physics determining their dynamics is well understood. What 
we precisely mean by entering the Hubble radius will be explained after we derived the perturbation 
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equations. Next one can fix a gauge to simplify the calculations. Some popular gauges are the Newtonian 
gauge (B = E = 0), the uniform density gauge (dp = E = 0 such that 1Z = 4/, which explains its name) 
and the comoving gauge (dq = E = 0 such that 1Z = 4/ and therefore it is also called the comoving 
curvature perturbation). 


Perturbation equations 

Computing the Einstein equations and the equations of motion of the perturbed fields leads to the following 
linearized equations of motion for the gauge-invariant perturbations. 


D 2 t Q a + 3 HD t Q a - ^Q a + ^Q b = 0, 
a z a z 

= V b V a - a 2 R a cdb T c T d +2e— ( T a V b + T b V a ) 
or n - v ---' <J 

’Mass matrix’ ( M 2 ) a b 

In conformal time these equations simplify by using the Mukhanov- 

D 2 q a + (-v 2 r b + n\) q b = 0, 

with 

= C\ - (2 - e)^ 2 n, 

with H = aH. Besides the scalar perturbations we also have tensor perturbations. Since the extra fields 
do not provide any additional tensor perturbations compared to canonical single field inflation, we can 
get exactly the same equations as we found in section 2.2.4. Because we will be working in the slow-roll 
regime this means it reduces to the single field slow roll expression for the power spectrum of the tensor 
perturbations. Expressed in terms of the slow-roll parameters, the tensor power spectrum is the same for 
every multi-field model. The definition of the power spectrum and the solution for tensor perturbations 
are given in section 3.2.6. Going back to the scalar perturbations we can divide the evolution of the 
perturbations roughly in three regimes. This can be most easily seen when the equations are expressed in 
terms of e-folds because then the coefficients are all dimensionless 


+ 2e(3 - e)H 2 T a T b . 


(3.17) 


Sasaki variables: 


(3.18) 

(3.19) 


D z N Q a + 3- 


G- 


2Mp 


D N Q a + 


k 2 


a 2 H 2 


Q a + 


C a 


a 2 H 2 


Q b = 0. 


Here D jv is the covariant derivative on the field manifold with respect to the number of e-folds N, which is 
not equal to N a X7 a . Note we are studying Fourier modes of the perturbations with comoving wavevector 
k. During inflation aH will increase quasi-exponentially with N, where 1 /aH is the so-called Hubble 
radius. This means we can identify three scales of interest, sub-Hubble scales k 3> aH , transition scales 
k ~ aH and super-Hubble scales k <C aH. At sub-Hubble scales the coefficient a ^ H - 2 dominates over the 
other coefficients, which means the differential equations decouple and we can solve them exactly. One 
has to be careful though with additional mass scales which may introduce another regime of interest. At 
super-Hubble scales we can neglect Jh 2 which allows us in certain cases to estimate the behavior of the 
solutions. During the transition region the curvature modes under consideration cross the Hubble radius 
and get frozen if the isocurvature modes are small enough. We will go into more detail in solving these 
equations in section 3.3, but first we will discuss the quantization of the perturbations in section 3.2.6 
and express the coefficients of the perturbations equations in terms of the slow-roll and turn parameters 
in section 3.2.7. 


3.2.6 Quantization of perturbations 

We use canonical quantization to quantize the perturbation fields. 
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Canonical coordinates and momenta of q a in any frame 

The equations of motion for q a can be expressed in any frame, given by a set of vielbeins {e J a }. In 
this section we stay with a general frame but keep in mind that there is a preferred frame given by the 
tangent and normal directions defined by the inflationary trajectory {T a , N a , ...}. Defining q 1 = e 1 a q a the 
equations of motion become: 

q 1 " + 2 Z I J q J ' + (Z7 + Z J K Zf) q J + (-V 2 l5 + Qj) q J = 0, (3.20) 

with 

Z 1 j = e 1 a (D T e a j) and = e{^e b j. (3.21) 

The equations of motion for q 1 correspond to the following Lagrangian density 

£ = \ [W + Z 1 jq J ) 2 - (V) 2 - q^ijq’ 7 ] , 

where the overall time-independent factor is fixed by the limiting single field case, see for instance [15]. 
The Lagrangian has no canonical kinetic terms, but that can be resolved by a redefinition of the fields 

q 1 = Rj(T)q J where R satisfies R 1 J = —Z^R 1 } with Rj{Ti) = I j for some initial time r t . 

Now the equations of motion for q 1 reduce to: 

q 1 " + (—V 2 Ij + (R^nR) 1 ;) q J = 0, (3.22) 

and the Lagrangian density becomes 

C = \ [( q 7/ ) 2 - (Vg 7 ) 2 - q I (BT 1 tlR) I jq J ] , 

which has standard kinetic terms. Therefore we know how to quantize the canonical coordinate and 
momentum fields q 1 and i t 1 = q 1 '. We promote them to operators and impose the standard equal-time 
commutation relations. From this we can derive that the pair {q 1 , ir 1 = q Ir + Z^q 7 } are the corresponding 
coordinate and momentum fields in the non-canonical frame. 

q 1 —> q 1 , 7 t 1 — > n 1 , q 1 —> q 1 , ir 1 —> n 1 , 

[g 7 (r,x),# J (r,y)] = i5 IJ 5{x - y), hence [q 1 (r, x), tt‘ J ( r, y)] = R^RlI^ (t,x), # l (t, y)] = i5 IJ 5{x - y). 

(3.23) 


Creation and annihilation operators and initial conditions 

Before we quantize the theory, we rewrite the solution to the equations of motion for q 1 and tt 1 in a 
convenient way. Expanding q a in terms of its spatial Fourier modes q 



d 3 k 

(27t) 3 / 2 


Qk(T)e 


ik x 


the equations of motion become 


( lk + 2 Z 1 jtfH + ( Z + Z t k Zj ) + (k% + Qj) q£ — 0. (3.24) 

The solutions of the equations of motion for q £ are given by complex mode functions (. q T k ) a (& n d (7r^) a = 
(qDcx + Zj(q£) a ) , where a runs from 1 to n. Each a represents a set of solutions {(q[) a , (q {*) a } (possibly 
zero) such that the general solution of q £ is determined by a constant complex vector a^: 


qL = [(«fc)a( a k)a + (9fc*)a(a-k)a] 

a 
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and the solution for 7r k becomes 


*k = Y [( 7r Jfc)a(°k)« + ( 7r fe*)a(°-k)a] • 

a 

Note that the reality condition g k *(r) = g£ k (r) is manifestly satisfied. After promoting the vectors 

(ak)a and their complex conjugates to the operators («k)a and (d k ) a , we would like to interpret them as 
annihilation and creation operators respectively, which satisfy the usual commutation relations 

[(fik)a, («q)/?] = da/ 3 <^(k - q). (3.25) 

These are consistent with the commutation relations (3.23) if and only if the mode functions are normalized 
as follows: 

Y [(dietin',() a - (?Tfc*)a(gfc)a] = i5 U , 

a 

Y [(9fc)a(«fc*)a “ (9ifc)a(9fc*)a] = 0, (3.26) 

a 

Y [( 7r fc)«( 7r fc*)« - ( 7r fe)a( 7r f )a] = °- 

a 

Another constraint on the mode functions comes from imposing initial conditions. 

Assumption 1. At the beginning of inflation the initial state of the universe is the vacuum state |0), 
defined by d J k |0) = 0 for all scales k of interest, such that there is no initial particle production, i.e. the 
Bunch-Davies vacuum. 


Since we are only interested in the observable modes which are sub-Hubble at the beginning of inflation 
(UV modes) and we are working in the linear approximation where different modes do not interact, this 
statement implies that we assume an initial state of minimal energy for these UV modes 3 . Therefore the 
Hamiltonian does not contain any terms proportional to aa and a) a ). After some algebra the Hamiltonian 
can be written as: 


H = J d 3 x 
1 


Y^q 1 ’) ~ C 


E d k (T).) Q ,^(a k ) 0 ,(a^ k ) ( g + (Gfc)a/3(®k)o(®k)/3 T h.c. 


a/3 


with _ 

( F k)afi = X! [( 7r fc)«( 7r fc)/3 - 2 ( 7r fc)a^j(^fc)/3 + Wk) ^ (q {)0 + {q{)a^ j(<?fc )fi] , 

I 

( Gk)a/3 = Y] [( 7r fc)a( 7r fc*)/9 — 2 (^fc)aZj(<7fc*)/3 + {Qk)ak 2 {q{*)p + (<7fc)a^j(<?fc*)y8] • 
I 

The energy of the ground state is given by 


H l°> = 2 


I ' 


d 3 kh 3 (0) Y [( 7r fc)«( 7r fc*)« + k 2 {ql)a(ql*) a ~ 2(q I k ) a Z I J (ir£*) a + (?*)afij(9k*)a . 


Here we interpret h(0) as the infinite total volume of space 4 V and we factor it out by demanding that the 
energy density (0| H |0) jV is minimized. Therefore at the beginning of inflation to, which is assumed to 

3 This means we do not specify the initial conditions for the infrared modes k aH. It is however possible to define a 
global Bunch-Davies vacuum following the lines of [46]. 

4 See [47] p.72. 
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be deep inside the sub-Hubble regime where we can assume k Zj and k 2 flj, we have in addition 
to (3.26) the following constraints: 


EE [l( 7r fc)a| 2 + ^ 2 |(?i)a| 2 ] is minimized 

' “ (3.27) 

X! [( 7r fc)«( 7r fc)/3 + ^(Qk)M)p\ = 0 

i 

Writing the sub-Hubble solution to the equations of motion for q T a as 

q I a = a I a e ikT + b I a e- ikT . 

and denoting a ! a and b J a as the matrices A and B, we can rewrite the constraints (3.26, 3.27) as the 
following matrix equations 

A*A t + A* t A - (B*B t + B* t B) = -I, 

k 

ab* t = ( ab* t ) t , ba* t = ( ba* t ) t 

Tr ( AA* t + BB* t ) is minimized, 

B t A + A t B = 0. 


Combining the first and third equation and using the cyclic property of the trace we get 

Tr (BiBj + B 2 B 2 ) is minimized, with B = B\ + ii? 2 , 

and since B\ and B 2 are real matrices this trace is always nonnegative and therefore the solution is given 
by B = 0. We are free to choose A ^ ~ 5^ and we can deduce that at early times r the full solution is 
given by 

(AM = (3.28) 

for some random phase X 1 . This completely determines the evolution of q £, however the equations of 
motion are not exact solvable therefore we provide an overview of analytical approximations in the next 
chapters. 


Power spectrum 

In the end we would like to compare theory to observations, therefore the main quantities of interest 
are the n-point correlation functions characterizing the distribution of the fluctuations. The lowest order 
statistics are given by two-point correlators. The power spectrum is the dimensionless Fourier transform 
of the two-point correlation function and corresponds to the variance of the distribution of fluctuations of 
a given mode k. The powerspectra V 2J for q £ are defined as 

97 t 2 

<01 ql{-r)qi,(T) |0> = <y(k - k / )-p-Pg J (/c,r), 

such that 

/ u3 u 

Wy). 

This will in particular yield the following powerspectrum for the comoving curvature perturbation 

Vn = (3.29) 

a~cr z H 
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In case of two fields we define the isocurvature perturbation as S = ^q N 
following power spectra 


V S 


H 2 V NN 
a 2 <j 2 q ’ 


and this yields in addition the 

(3.30) 


V TZS = 


H 2 
a 2 a 2 


V, 


TN 


(3.31) 


The power spectra for the curvature and isocurvature modes are evaluated at the end of inflation which is 
equal to their value shortly after Hubble radius crossing if the isocurvature modes decay quickly enough. 
The gravitational waves are characterized by the power spectrum Vt which is expressed in terms of the 
slow-roll parameters the same for every multi-field model, namely 


Vt — 2 Vh 


2 


JV_ 

irMp 


2 

[1 + 2(2 — In 2 — 7 )e*], 


(3.32) 


with 7 Euler’s constant and where the asterisk denotes that the quantities are evaluated at Hubble radius 
crossing. In this expression first order slow-roll corrections ' 5 are taken into account compared to our 
computations in section 2.2.5 and therefore we get the multiplicative factor 1 + 2(2 — In2 — 7 )e*, see 
for example [48]. The power spectra evaluated at Hubble radius crossing depend on the wavenumber k. 
Assuming the powerspectrum can be approximated by a power law, it is convenient to write 

( r \ n a —l+l/2a s lnfc/fc*+... 

kj 

/ U \ nt+l/2at lnfc/fc*+... 


where A s and At are the scalar and tensor amplitudes, n s and nt the scalar and tensor spectral index and 
a s = dn s /dlnk and at = dnt/dlnk the running of the scalar and the tensor spectral indices. The Planck 
collaboration [27] has found 

A s = (2.196 ± 0.060) x 10 -9 for a pivot scale L* = 0.05Mpc _1 , 
n s = 0.9603 ± 0.0073 assuming At = 0, 


but the BICEP collaboration [31] contradicts this latter assumption by the following claimed measurement 
of the tensor-to-scalar power ratio 


r ~ 0.2 


with 



This means one should take into account the degeneracy between At and n s . The scales which can be 
related to the CMB measurements and to future large scale structure surveys [49] run from k = 0.002Mpc ~ 1 
to k = O.lOOMpc -1 corresponding to scales which leave the Hubble radius 60 to 53 e-folds before the end 
of inflation. 


Dictionary perturbation equations 

Let us take stock of the variables introduced to describe the perturbation equations and the quantization 
procedure discussed in sections 3.2.5 and 3.2.6. A translation of all notation introduced in these section 
to the notation used in the literature is given in Table 3.5. Most of the definitions are equivalent, so for 
these the dictionary is a pure translation of symbols. When there is a difference in definition however, 
this is indicated in blue and with an underline. 

5 More precisely, the multiplicative factor reflects corrections to j3 « | and — y ~ aH as stated at the very the end of 
section 2.2.4. 
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Quantity 

Notation 

here 

Notation AP 

Notation TG 

Notation 

GW 

Notation PT 

Field perturbations 

8 (j) a 

5(j) a 

8 <j) 

dpi 

8 (j) 


Metric perturbation 

<h 

*l> 

$ 


T 


Gauge inv. pert. 

Q a 

Q a 

- 

Qi 

Hi 


Mukhanov-Sasaki var. 

q a 

v a 

q 


q 


Com. curv. pert. 

77 

77 

- 

77 

77 


Entropy pert. 

5 

5 

- 

5s = jjS 

5 


Auxiliary functions 

° b 

ria _ 1 ria 

L b ~ 7?^ b 

- 

- 

- 



(M 2 ) a b 

- 

M 2 

- 

M', = 

M a b v a V b 

- V V 2 


n\ 



- 

- 


Vielbeins for any frame 

e 1 

ei 

- 

_ 

_ 


Preferred frame 

T a , N a ,... 

T a , N a ,... 


- 



Coordinates of q 

q 1 

v 1 

q n 

- 

Qn 


Coordinates of Q 

Q 1 

Q 1 

- 

Qi 

Hn 


Coordinates of Q 

Si', 

Si'; 

^mn 

- 

- 


Rotation matrix 

Rj 

Rj 

Rnm 

- 

Unm 


Z-matrix 



Zmn — 

- 

^mn ~ 

= 

Rotated perturbation 

Q 1 

u 1 

q n 

- 

q± 


Fourier modes q 1 

qi 

u 7 (k ,t) 

q n 

- 

Qn 


Solutions to EOM q £ 

0 ql)« 

viik.r) 

Qnm 

- 

Qn 


Power spectra 

T)IJ 

<L 

'pU 

’ V 

- 

_ 

_ 



v\> 

Rq 

- 

- 

- 




Vn 






Table 3.5: This dictionary provides a translation of the notation and definition introduced in section 3.2.5 and 
section 3.2.6 to the notation used in the other papers. This concerns notation used to describe the perturbations 
equations and the quantization procedure. When there is a difference in definition we indicate this in blue and with 
an underline. The abbreviations used in the first row are explained in table 3.1. We leave out CT because in their 
set of papers they only consider two fields. This is discussed in section 3.2.3. 

3.2.7 Perturbation equations in terms of the slow-roll and turn parameters 

Finally we project the perturbations equations along the tangent and normal directions to get more insight 
in the behavior of these equations by rewriting the coefficients in terms of the slow roll and turn parameters. 
From now on we will only consider the case of two fields and leave the case of more fields to future work. 
We will see that ^ couples the curvature mode to the isocurvature mode whenever there is a mismatch 
between the geodesics of field space and the valley of the potential. Therefore rQ parameterizes the multi- 
field effects as expected. In order to keep things clear we choose to work with the kinematical slow roll 
parameters. We will however switch between the time variables, depending on what is most convenient 
for the next section, in which we discuss analytical approximations of the solutions. 

Projection of the perturbation equations with two fields 

We simplify our analysis to two fields and project the perturbation equations (3.17) and (3.18) along the 
tangent T a and normal direction N a , which are defined in section 3.2.2. Therefore we first express the 
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entries of the 17 matrix defined in (3.19) in terms of the slow-roll and turn parameters. We give the four 
components Qtt, ^ tn, &nt and &nn, separately where the index T or N stands for a projection along 
the tangent or normal direction, e.g. £Itn = T a N b £l a b. Because 17 is symmetric we have £Itn = nt and 
the four entries are therefore fully specified by 

Htt = -a 2 H 2 (2 - e + t}h (—3 + e + i]h + £h) ~ (v ± ) 2 ^j , 
n NT = a 2 H 2 7j ± (3 - e — 2 rj H - ^ , 

^nn = — a 2 H 2 (2 — e — m 2 ) , 

with m 2 = Mjr N /H 2 . Moreover, if we wish to express the perturbation equations in the preferred basis 
{T a , N a }, we can see from equation (3.20) that we also should express the Z matrix in terms of the slow-roll 
and turn parameters. The Z matrix, which is defined in equation (3.21), is anti-symmetric and therefore 
in case of two fields it is given by Ztn = —Znt = aHrj 2 -, if we pick the vielbeins {e^} to be { T a , N a }. 

Now projecting the equations for the generalized Mukhanov-Sasaki variables (3.18) as function of conformal 
time along the tangent and normal direction, we get the following system of coupled differential equations 

q J " + 2i] ± aHq N ' + ( k 2 + a 2 H 2 (-2 + e + (3 - e)r)H + v’h ~ Vh)) ^ + i^a 2 !! 2 ^4 - 2e - 2t)h - 2£ ± ) q N = 0, 
q N " - 2r] ± aHq T ' + (k 2 + cl 2 H 2 (m 2 - 2 + e - (?? X ) 2 )) q N + i^^H 2 (2 - 2r) H ) q T = 0. 

(3.33) 

These are the equations describing the evolution of the perturbations which are canonically quantized 
and therefore useful to study during the sub-Hubble regime in section 3.3.1 where the initial conditions 
are imposed. We see that the differential equations couple to each other as soon as r] 2 - / 0, i.e. if the 
inflationary trajectory is curved with respect to the geodesics in field space. If there are no turns, then 
the equations decouple and the equation for q T becomes equal to the canonical single field version. As 
expected, the turn parameter rj 2 - carries the most important information about the multi-field effects and 
therefore should be considered separately from the slow-roll parameters. The variables we are ultimately 
interested in to compare with experiment are the curvature and isocurvature perturbation 1Z and S. We 
can rewrite the full equations of motion for the perturbations (3.35) in terms of 1Z = ^4- and S = 4— and 
get 

(Jv +3 ~ €H ~ 2m ) ( 7z; + 2ri±s ) + Jip' R = 0 ’ . . 

/ j~2 M 2 \ (3.34) 

cS ;: + (3 - e H ~ 2 r) H )S' + ( ^2 + Jp ~ + Vh(.Vh - 3 + e H ) - rf H \ S = 27 1 ± 'R\ 

Remember that the curvature perturbation 1Z corresponds to fluctuations in the fields along the infla¬ 
tionary trajectory and represents the spatial curvature perturbation in the uniform density gauge. The 
isocurvature perturbation S keeps the spatial curvature perturbation and total energy density of the fields 
constant, but mixes up the relative ratios of the energy densities of the fields. If the isocurvature pertur¬ 
bations decay on super-Hubble scales, then the curvature perturbation will freeze out [29]. This means 
the value of the mode functions of 1Z evaluated shortly after Hubble radius crossing will stay constant 
until the mode enters the Hubble radius again in an epoch of known physics and therefore we can relate 
the predicted distribution of the curvature modes to the observed temperature fluctuations in the CMB. 

The same remarks on the perturbation equations (3.33) for q a also apply here. Concerning the signs of 
coefficients, whether the turn is to the right or to the left it should not change the way the curvature and 
isocurvature modes are affected, because it is just a matter of convention. Note that we do not need to 
take absolute values of i] 2 - for the source terms of the differential equations, because each projection on N a 
also contains the same left/right information given by SN(t). All the terms in the curvature perturbation 
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equation therefore contain an even number of ?y^’s and N a, s whereas all the terms in the isocurvature 
perturbation equation contain an odd number. The equations (3.34) will be extensively used to discuss 
the analytical approximations of the solution of the perturbation equations in the super-Hubble regime in 
section 3.3.2 and for one of the approximation schemes in the transition regime in 3.3.3. Besides equations 
(3.34), sometimes the equations for the variables Q a (3.17) as function of number of e-folds are used to 
describe the super-Hubble evolution 


Q T ’’ + (3 
Q Nv ’ + (3 


e)Q T ' + 


( k 2 

\a 2 H' 2 


+ (3 - e)r] H + rf H - riff ) Q 1 = -2^ Q JV; + (3 - q H -e- Q 




t)Q N ' + + 77)2 _ [Q T; + vhQ t ] 


(3.35) 


We will only use these equations to study the ‘slow-roll approximation on perturbations’ in section 3.3.2. 
Finally, during transition the equations can be simplified when we assume for example the first order 
SRST approximation. Using use the expressions (3.16) we get 


q T "+ k z - 


q N "+ k z + 


2 + 3e 


l + 2e 


q = 2r/ J 


[m 


1 + e q Nl 

T 
N 


1 + 2e 


(2 ~e)q 


,N 


- 2 + e) )q N = —2rj 


1 “h 6 rp. 1 -|- 26 rp 

-9 +-o ^ 

'T 'T^‘ 


(3.36) 


where we kept everything to linear order in e and rj 1 ~. This is useful for one of the approximation schemes 
to solve the perturbation equations in the transition regime studied in section 3.3. 


Dictionary perturbation equations with two-fields 

Some of the papers we include in this overview consider the particular case of two fields. In Table 3.6 we 
provide a dictionary to translate the variables introduced in section 3.2.7 to the notation used in these 
particular papers. When there is a difference in definition of the variables we indicate this in blue and 
with an underline. 
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Quantity 

Notation 

here 

AP 

GW 

CT 

Two fields 

4>,x 

A A 2 

< i>,x 

xA 


Gauge inv. pert. 

Q T iQ N 

Q T ,Q N 

Q cr ? 

Qai (^5 


Mukhanov-Sasaki var. 

q t ,q n 

T N 

V , V 

- 

'U'CT 5 U s 


Com. curv. pert. 

n 

n 

n 

n 


Isocurvature pert. 

s 

s 

5s = jjS 

s 


Auxiliary functions 

M 2 

M 2 

As = M 2 + 3jH -2 ?? ±2 

- 



O.j'T 


- 

M u = 

_ fl TT , r, 

“ HAW 1 + 1 


Qtn 

VLtn 

- 

M 12 = 

_ TN 

“ aPTP 


Qnn 

HiViV 

- 

-Ad 22 = 

_ £Inn i o 

“ “T" Z 


Ctt 



Caa = 

Ctt—^tn 




a 2 


Ctn 

- 

- 

C aS = 

Ct n “I - a n H - 2 a H Z'j' n 

a 2 





c S(T = 

Ctn+uZtn — 2clHZtn 

a 2 


Cnn 



c ss = 

Cnn Ztn 




a 2 


Vtt 

Vu 


Vaa 

v TT +T a T b r c ab v c 


3 H 2 


Vtn 

V(f)N 


Vas = 

V TN +T a N b r c ab V c 



3 H 2 


Vnn 

Vnn 


Vss = 

V NN +N a NT° h V c 


3IP 


Z T n 

Z T n 

- 

- 


Power spectra 

Vn 

Vn 

VQ a 

Vn 



V s 

V s 

V Ss =? 

V s 



V-jzs 

Vns 

CQ a Ss =? 

~ 



Table 3.6: This dictionary provides a translation of the notation used to describe the perturbations equations in 
case of two fields. When there is a difference in definition we indicate this in blue and with an underline. The 
abbreviations used in the first row are explained in Table 3.1. This dictionary complements the dictionary given in 
Table 3.5. 


3.3 Analytical approximations 

Now we have introduced all the relevant notions, in particular the important turn parameter which pa¬ 
rameterizes the multi-field effects, we continue studying the various approximation schemes to solve the 
perturbation equations in case of two fields. In this section we mainly work with the time variable number 
of e-folds of inflation N and the dimensionless positive time variable z = —hr. We denote a derivative 
with respect to N as semi colon and a derivative with respect to z exactly like we denoted the derivative 
with respect to r, with a prime. As explained at the end of section 3.2.5 the evolution of the modes of the 
perturbations can be divided into three regimes, sub-Hubble a <C jj, transition a ~ jj and super-Hubble 
a> j. We will denote the beginning and the end of transition as and z +, or iV_ and N + , respectively. 
A quantity A evaluated at Hubble radius crossing is denoted by A*. We fix the value of z at Hubble 
radius crossing at z* ~ 1 as explained in section 3.2.3. Finally we will be sloppy with jargon and entitle 
all variables q T , Q T , 7Z as the curvature or adiabatic perturbation and all variables q N , Q N , S as the 
isocurvature or entropy perturbation. 

It turns out that only two types of approximations are made in the papers under consideration. The 
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approximation schemes can roughly be summarized as follows 

The slow-roll approximation and either a mass hierarchy m 2 3> 1 or the slow-turn approximation. 

In all papers the slow-roll approximation is assumed to be true from deep inside the sub-Hubble regime 
until far beyond the transition regime in the super-Hubble regime. One part of the literature (CT, AP, 
see Table 3.1) assumes a large mass hierarchy and an effective single field description is derived. The 
other part of the literature (GW, TG and PT) assumes a slow-turn approximation and the perturbation 
equations are solved by treating the slow-roll and turn parameters on equal footing. We study the regime 
of validity of the two approximation schemes by rederiving their analytical solutions of the perturbation 
equations. Many details on the computations can be found in Appendix A. We will first discuss the 
early and late time evolution of the perturbations and then solve the equations in the transition regime 
in the two approximation schemes. Finally in section 3.3.4 the approximations and their validity and the 
predictions are summarized in Table 3.7. 


3.3.1 Sub-Hubble 

In the sub-Hubble regime we have k 2 3> a?H 2 or equivalently z 2 3> 1 and the perturbation equations 
(3.33) reduce to 6 

4 "+ 4 = o, 

4 " + Qa = 0 , 


where the solution is determined by assuming the initial state of the universe is the Bunch-Davies vacuum 


like we found in section 3.2.6. 



5 


i 


5 


2 

ex') 


This is not the complete story however, because if there exists a mass hierarchy m 2 3> 1, the frequency 
of the isocurvature modes changes before the beginning of the transition region. This happens at the 
scale k 2 /a 2 H 2 ~ m 2 . In this case the sub-Hubble regime should be divided into two regimes, one where 
k dominates all other scales in the perturbation equations k 2 /a 2 H 2 3> m 2 3> 1 and where we can impose 
the initial conditions, the other where the isocurvature mass scale dominates m 2 3> k 2 /a 2 H 2 2> 1. In the 
second part of the sub-Hubble regime we expect the following behavior of the solution of the perturbations 
equations 

qZ = (A-) a e i f dtu -+(A + ) a e i f dtu +, 

q% = (S_) Q e i J' dtw - + (B + ) a e*f dt “+, 

where t c 2 _ ~ k 2 /a 2 , uj\ « H 2 m 2 + k 2 /a 2 and (A + ) a = (B-) a = 0 when there are no turns. In case of a 
constant turn rate rj 2 - we find that the high frequency dissociates even more from the low frequency such 
that in any case uj! 3> ui 2 _ 3> H 2 . The solution becomes approximately 


4 ~ ^ + \h 2 (m 2 + 3(4) 2 ) (1 ± 1) ± 4 

—2irj L Hu: + 


2/V.-U2 


k 2 (r] 


a 2 (in 2 + 3 (rj 2 -) 2 ) ’ 


4+) 

(B-) 


2 7 9 / 9r(-6+)a> 

— k z / a z 

H 2 (m 2 — (r/ 2 -) 2 ) + k 2 /a 2 — w 2 _ 


(3.37) 


4-)c 


6 Remember we are working with the variable z 


— hr and a prime denotes a derivative with respect to 2 in this section. 
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Using the quantization conditions (3.26) we can estimate (A_)„ ~ ~^= = 5 p and ( B + ) a ~ -A=S^, 
which means that the high frequency modes are relatively suppressed. Since the difference between the 
frequencies becomes larger in case of a constant turn rate we expect that these conclusions also apply for 
the more general case of slow roll inflation and any turn rate which does not vary too quickly in time. 
In the next section we will show in addition that the high frequency modes will decay on super-Hubble 
scales. This means we can neglect the high frequency solutions as soon as the mode enters the part of the 
sub-Hubble regime where m 2 3> k 2 /a 2 H 2 which allows us to compute an effective theory in section 3.3.3. 
More calculational details can be found in Appendix A. 


3.3.2 Super-Hubble 

We now study the behavior of the perturbation equations in the super-Hubble regime where we can neglect 
k 2 farH 2 or equivalently take z<l. We start with a simplified case where we assume that the zeroth 
order slow roll approximation is valid and that both m 2 and ij 2 - are constant. Therefore we can use the 
perturbation equations (3.36) where we put e = k = 0 and change variables to z 


<£' - =v (),»- !<£), 

+ b (m 2 - 2 - (n 1 ) 2 ) •£ = -2ij x ( b'.' + bt'). 


Using the ansatz q J a = A : a z p , where A £ are constants and P some power, we find four solutions to the 
perturbation equations. The full solution is given by 


q T a = 8l (Az- 1 + Bz 2 ) + ^C 4,? * '_ 

V ' 3 + ^9 — 4(m 2 + 3(??- L ) 2 ) 


X z h (i+^9-4(3 (v ± ) 2 +m 2 )) ^±^.5 ( 1 -\/ 9 - 4 ( 3 ( , ?" L ) 2 + m2 )) 


q a 4 [ B ^±y_ rn 2 Z 


“ 3 + v / 9-4(m 2 + 3(??- L ) 2 ) “ 3 - ^9 - 4(m 2 + 3(r/-L) 2 ) ’ 

2^ +() -A r ^(J z h (i+a/9-4(3 (,P) 2 +m 2 )) + Dz l ( X " \ , 


to zeroth order in slow roll. Note that the constants C and D are determined by the initial conditions 
for q N and therefore the combinations rj 2 - C and r] P D have a sign independent of the convention for sn 
at the beginning of inflation. From this example we read off the general behavior of the perturbations at 
super-Hubble scales. We have a growing mode q T ~ z^ 1 and a decaying mode q 1 ~ z 2 whose frequencies 
goes to zero. If there is no mass hierarchy m 2 <C 1 and a very small turn rate q 2 ~ <C 1, then we have 
another pair of growing and decaying modes q N ~ z -1 and q N ~ z 2 , which means the isocurvature modes 
will not decay and for non vanishing rj 2 - they will keep sourcing the curvature perturbations. If there 
is a large mass hierarchy m 2 3> 1 then the other modes are decaying q 1 ~ z 1 ^ 2 with a high frequency 
determined by the effective mass squared M 2 ff = (m 2 + 3(r/- L ) 2 )FI 2 . In this case we find that after Hubble 
radius crossing the curvature mode freezes out, TZ ~ z°, while the isocurvature modes oscillate rapidly 
and decay, S ~ z 3 / 2 . In Appendix A we derive the full solution to first order in slow roll and we find the 
same behavior. 


Next we study the more general super-Hubble solution. We use the full equations of motion (3.34) for 
the curvature and isocurvature perturbations 1Z and S as function of number of e-folds. These equations 
yield the following exact solution in the super-Hubble regime 

W = - 2 ^ 5 , 

where the exponential corresponds to the decaying solution 7 Z ^ a~ 2 q 2 ~ a -3 we found above and if there 
are no turns the isocurvature modes stop sourcing 7 Z and it will become constant. Assuming we pick N + 
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large enough, such that we can neglect the decaying part, the system reduces to 


W = —2r] ± S, 

/ o i 9 . \ (3.38) 

«S” + (3-e- 2rj H )S’ + (m 2 + 3(?y L ) 2 + ij h (vh - 3 + e) - r]’ H J S = 0 . 

We will approximate the solutions in case of a mass hierarchy m 2 2> 1 and in case of the SRST approxi¬ 
mation. 


Case 1: a mass hierarchy m 2 3> 1 

In first order slow roll and when S is heavy enough we expect S ~ a -3 / 2 in the super-Hubble regime 
because S ~ q N /a and we found two decaying solutions q N ~ a -1 / 2 in case of constant turns and constant 
mass. This means all the isocurvature perturbations decay and we have a freeze out of the curvature 
perturbation on super-Hubble scales. Assuming m 2 3> 1 and approximately constant we can derive the 
conditions on rj ± such that this expectation is true. Rewriting the equation as 

cS” + /iS ; + f 2 S = 0, 

S” = hS, with S = exp J(fi/2)dN S and / 3 = ^/, 2 + ^/j - / 2 , 


we can use the WKB approximation to find 


S = A exp 



1 fj | 1 (fj 

4/3 8Vh\f3 



+ Bexp /^(-vT-4 


1/i . 1 (fj 

4/3 8 



where the functions f± and / 3 expressed in the slow-roll and turn parameters take the form 


fi = 3 - e - 2 i] H , 

/3 = | ( 9 - 4(m 2 + 3(?? ± ) 2 ) - 6e + e 2 + 2e?/ ff ) • 

Only if / 3 is varying rapidly this might lead to a non-decaying solution for 5, therefore we can derive the 
following rough constraint on ^ _L to assure that the isocurvature modes will decay on super-Hubble scales 

< (3 - e - 2 r] H ) ^1 + assuming m 2 > ^ - 3(r/ ± ) 2 > |. (3.39) 

Note that the decaying solutions are also oscillating as e = t *\/ m “+ 3 ( , ? ± )“^ 9 / 4Ar while the frequency of the 
curvature mode goes to zero. As remarked in section 3.3.1 the frequency of the isocurvature mode changes 
after a time z < m. The curvature and isocurvature modes are described by a system of coupled harmonic 
oscillators, but do not ‘synchronize’ because they are coupled by a derivative interaction. Instead they will 
tend to oscillate with a low frequency and a high frequency which dissociate even more if the coupling is 
turned on. Based on the analysis in this section and in section 3.3.1 we expect the high frequency modes 
to be highly suppressed with respect to the low frequency modes already after a time z < m and therefore 
they do not contribute to the final amplitude of TZ. Therefore one only needs to take into account the low 
frequency mode for which the time derivatives of the isocurvature perturbations can be neglected, because 
they correspond to the slowly varying oscillations with a frequency much smaller than m. In section 3.3.3 
we will integrate out the heavy mode which basically comes down to neglecting the high frequency solutions 
and approximating the low frequency solution using the mass hierarchy. 


— < /I 

2/3 J 
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Case 2: the SRST approximation 

In the studies (TG, PT) the slow roll approximation on perturbations is made, which comes down to 
neglecting Q T;; and Q N ;; in the perturbation equations. It is based on the observation that on super- 
Hubble scales the background fields <p a satisfy the same equations of motion as the perturbed fields cf) a +5(j) a . 
Assuming both the slow-roll and the slow-turn approximation we can neglect Djsi<j) a '\ This means we can 
use the same approximation on the perturbed fields and therefore the difference, the equation for Q a given 
in (3.17), does not contain a double derivative. This means the perturbation equations (3.35) for two fields 
become 

(3 - e)Q T; + ((3 - e)rj H + rf’ H - rfe) Q 1 = -2rj L Q N ' + (3 - e - r) H - t, ± )Q N , 

(3 - e)Q N ' + (m 2 - (ry 1 ) 2 ) Q N = 2 r] 1 [< Q r ' + tjhQ 1 } . 

Transforming back to the the variables 77 and S we get 

(3 - e)77 : + (rj' H - if H ) 77 = -2 1 ] 1 - S' + (3 - e - 2r) H - ^ )S , 

(3 - e)S : + (m 2 - (p L ) 2 - (3 - e)r]H^ <S = 2??- L 77 : . 

The equation for 77 does not simplify so we only use the second equation for S and stay with 77 ; = —2 rj-^S 
which yields 

77 ; = -2?? ± cS+ exp ^ (i + |) (m 2 + 3(?y L ) 2 - (3 - dN , 

where 5+ times the exponential is the solution for S. Therefore at super-Hubble scales we have the 
following expression for the curvature perturbations 

f N ( " 1 1 

77 = 77+ — S + J < 2rp L exp J - (l + |) (m 2 + Z{r] L ) 2 - (3 - e)rj H S ) dN | dN. (3.40) 

Note that the decaying solution S ~ e“ 3iV is automatically discarded. If there is no large mass hierarchy 
m 2 <C 1, but a nonzero (small) turn rate, the curvature mode does not freeze out completely and its 
evolution needs to be tracked until the end of inflation. If there is a small mass hierarchy m 2 ~ 1 then it 
is expected that the curvature mode will freeze out quickly after Hubble radius crossing. 

3.3.3 Transition 

Next we need to solve the perturbation equations in the transition regime in order to match the sub-Hubble 
initial conditions to the super-Hubble solution. In case of a mass hierarchy between the curvature and 
isocurvature mode we can compute an effective single field description from the moment that z < m. If 
we assume the SRST approximation we can try to solve the full differential equation in this regime. 


Case 1: effective single field description in case of a large mass hierarchy 


As explained in section 3.3.2 we integrate out the heavy modes when z < m. The low frequency solution 
allows us to neglect the time derivatives of S resulting in an effective equation for 77 


77” + 3 — e — 2 pH — 2- 


r .2 ;,2 


(3.41) 


This equation has a variable speed of sound c s and therefore differs from the equations which follow from 
a canonical single field inflationary theory. The speed of sound c s in terms of the background quantities 
is given by 

1 4(??- l ) 2 

- 1 I V / / /O 


“^7 = 1 + 


+ m 2 - (r/ 2 -) 2 + r} H (rj H - 3 + e) - rf H 


(3.42) 
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This allows us to derive the effective action for the perturbations up to quadratic order where we can fix 
the overall normalization by putting c s = 1 and using the canonical single field result discussed in chapter 
?? 


Seff = M 2 p / dNd 3 x 


a 3 eH 


m 2 - 


a 2 H 2 


(diKf 


(3.43) 


The calculational details to find the effective action (3.43) can be found in Appendix A. 


Next we discuss the validity of the effective single field description. This effective single field description 
only holds for times z < m which means the number of e-folds before Hubble radius crossing at z = m 
equals AN = lnm. If you want to impose initial conditions on 7 Z deduced from the sub-Hubble solution 
you need therefore 

AN > 2.3 ~^m> 10. (3.44) 

Moreover, we can express the validity of the effective theory in terms of adiabatic conditions on the turn 
rate and the speed of sound following [43]. The derivations can be found in Appendix A. We find the 
following constraints on the speed of sound 


W 


u 


< min 



m 2 — (V - ) 2 \ 

\2k/aH — 2| J ’ 


< 


m 2 — {rj L ) 2 


(3.45) 


with u = 1 — cj 2 and independent on the fact whether u is small or not. When u <C 1 it is also 
possible to express these adiabatic conditions in terms of the turn parameter and its derivatives and we 
get approximately 


V 


<C \J\m 2 - (r/- 1 ) 2 ! 


-L\2i 


£ <C \m - (?y ) 


(3.46) 


These conditions ensure that the timescale of the duration of the turn is much bigger than the period 
of the oscillations about the flat direction of the potential and therefore we can safely neglect the high 
frequency oscillations. 


If the adiabatic conditions are satisfied it is possible to solve the effective single field equation (3.41) for 
1Z. We consider two cases, one for which we have a constant turn rate i] 2 - for all times and another in 
which we have a transient turn for which c s is assumed to go to 1 for the limits r —>• —oo and r —> 0. 
For the case of a constant turn rate, the initial conditions for the low frequency solution 7Z change in the 
second part of the sub-Hubble regime z < m as explained in section 3.3.1. Moreover, when c s < 1 the 
modes will cross the Hubble radius earlier because the sound horizon decreases with c s which can be seen 
in equation (3.41). Taking both effects into account, we find the following power spectrum 


Vtz 


H 2 


8e*7 x 2 c s Mp 


—'Pno, 

c s 


(3.47) 


which is about p- times the powerspectrum Vtio one would achieve when naively computing the truncated 
single field version. Actually one should also take into account the change of and e*, because if c s < 1 
the modes will cross the Hubble radius earlier which means that H * becomes bigger and e* smaller com¬ 
pared to the value one gets in the truncated single field version. This means the amplitude of the power 
spectrum will be enlarged even more. This effect is negligible when c s is close to 1 and this approximation 
is therefore valid for a large mass hierarchy in case of a constant turn rate satisfying (?y*-) 2 <C m 2 . For 
values of c s which are smaller we expect to have another small correction on top of this. In section ?? we 
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will discuss how the observables change. 


Assuming a transient turn rate and the adiabatic conditions (3.45) we can interpret the contribution of 
the change in the speed of sound to the effective quadratic action for perturbations (3.43) as a small 
perturbation 




m 2 - 


a 2 H 2 


(<W 2 


M% J dNd 3 x a 3 eHu(K’) 2 


with u = 1 — c s 2 . Using the in-in formalism, see for example [50], the change in the power spectrum to 
first order in u is found [51] to be 


AVtz 

'Ptz 


(k) = k 


dr u(t) sin(2 hr) 


(3.48) 


This means in case of a transient turn we get a slight scale-dependent power spectrum as in canonical 
single field inflation but with features on top of that. 


Case 2: full solution to the perturbation equations using the SRST approximation 


We now solve the perturbation equations in the transition regime while assuming the first order slow-roll 
and slow-turn approximation, see Table 3.2. We can diagonalize the perturbation equations (3.36) which 
become in this approximation 


fD 2 + k 2 + a 2 H 2 (-2 + e + m 2 _) 0 \ (u T 

y 0 D 2 + k 2 + a 2 H 2 (—2 + e + m 2 + ) J 

by rotating the fields to the new fields 

<*'M ^ (W)V(t), 

where the rotation matrix is given by 

/cos# — sin (A _ V~( 3 ~ g ) 

1 ^sin# cos 0 J 1 ( r? ±) 2 _ m 2 ‘ 


(3.49) 


In Appendix A the derivation of equation (3.49) can be found. In the studies TG and PT this equation 
is solved by Hankel functions, which are solutions to the Bessel equation. However, this equation is only 
equivalent to the Bessel equation if D 2 = d 2 , which is true if there are no turns during the transition 
period. Therefore in general we actually should start from equation (3.22) and diagonalize it such that we 
can solve it with Hankel functions. It turns out that the turn rate must be very small in order to make 
this identification. In Appendix A we derive the following constraints on the slow-roll and turn parameters 
which are necessary to solve the perturbation equations in the transition regime by Hankel functions 


e, « const and | r] ± \ < 0.04. 


(3.50) 


We assumed N± = TV* dL 2.3 as the beginning and the end of the transition regime. If these conditions are 
fulfilled the solutions of the equations of motion for u 1 are given by a linear combination of the Hankel 
functions of the first and second kind 


< = V~zH\\z)Ai + V~zHi\z)B I a , 


(3.51) 


where the constants v 1 are given by 


. T . 9 1 2 — e* - m 2 _ 9 (n 

(u T ) 2 = - + —-— « - + 3e* + 9 W 


-A2 


(1 — e*) 2 4 

1 2 — e* — ml 9 


m. 


{uN)2 = X + „ 1 + 3e , - (1 + 2e*)m 2 — 9 


(v 


±\2 


mz 
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Here we assumed first order slow roll and m 2 3> ( 77 ^ ) 2 , where the latter assumption is motivated by the 
discussion in section 3.2.3, although we would rather expect from this discussion that m 2 > ( r%) 2 ■ We 
come back to this later. The constants A J Q and B / in equation (3.51) can be found by matching the 
solution to the initial conditions for q ^ when z = Z- i$> 1. It turns out that the coefficients A ^ in front 
of the Hankel function of the first kind become zero. To find the solution at the end of transition we can 
use the asymptotic behavior of the Hankel function of the second kind in the limit z <C 1. We neglect the 
decaying solution z v +1 / 2 and transforming back to q we get at the end of transition 

Qa( z +) = (cos 2 65% + cos 6 sin 65%) f(v T ) (J^j 

1/2 

HjT ( / 2 x^-1/2 v n_ 1/2 

Qa ( z +) = \j Tk (cos 2 95% - cos 6 sin 65%) i f{v N ) J + g(v N ) J 
+ 

Here 6 is dehned at the beginning of this subsection and the functions / and g are introduced to shorten 
the expressions and given by 

ii/ 1 tt/ 2—iX 1 +iir/4 ^\%2T'( y iy ) 

7r 

_ -idW2-iATk/4V2(l -icot(nu N )) 

9{ ’ r(i + u N ) 

Now there are multiple possibilities. If m 2 -C 1 then v N ~ 3/2 and we can neglect the decaying solution 
z v +1 / 2 as well, but the first term is obviously a growing solution and this might produce a large spectrum 
for the isocurvature modes. If m 2 > | then the real part of u N becomes zero and both terms z± u +1 / 2 go 
like z 1 ! 2 and these modes decay like massive modes. We consider both cases separately. 



cos 6 sin 65% + sin 2 95% ) f(y J 


(0 


"- 1/2 



- (cos 6 sin 05% - sin 2 65%) ^f(v N ) (^- 


2 \ iyJV_1 / 2 / 9 

) 


If m 2 > | > r/- 1 we can approximate cos 6 = 1, sin 6 = n %% ^ and u 1 = | + e* we hnd that the curvature 
perturbation is completely frozen out at z+ such that we can derive the following power spectra at the 
end of inflation 


V-r. 


V s 


8e*7r 2 Mj> 

ii; 


8e*7r 2 M|, 


(1 + 2{C — l)e*) 
(1 + 2 (C — l)e*) 


1 + 


9 (n, 


X\2 


rri* 


9 (?/; 


.-L\2 


Tfl 


(3.52) 


In this limit, the power spectrum of the isocurvature modes can be neglected and the power spectrum 
of the curvature modes are indistinguishable from canonical single field inflation because the turn rate is 
extremely small compared to m. 


If m 2 <C 1 we can approximate v 1 = v N = §+e* and we find both curvature and isocurvature perturbations 
which evolve independently. Therefore we both get a spectrum of curvature modes and isocurvature modes 
at the end of the transition regime 


V-r. 


V s 


Hi 


8e*7r 2 Mp 

Hi 


8e*7r 2 Mj, 


(1 + 2(C — l)e*), 
(1 + 2 (C — l)e*). 


50 



What happens after z_|_ depends on what value the turn rate will take in the super-Hubble regime and 
on the physics after inflation. This makes it very hard to make any predictions of observables. The 
fact that the curvature modes and isocurvature modes evolve independently comes from the assumption 
m 2 3> (??"*") 2 , which means in this case that we have put r/ 2 - to zero during transition. This assumption 
should be weakened to m 2 > ( i]^) 2 and we expect that the biggest effects of the turn appear when 
m 2 > (j/ 2 -) 2 . However, since we still expect to get a sizable spectrum of isocurvature modes which we do 
not know how to relate to experiment, we do not perform the calculation. 

3.3.4 Overview 

We summarize the approximation schemes and their regime of validity in Table 3.7. In the papers we 
have been studying, two different approximation schemes are being used. In the studies TG, PT and GW 
the perturbations equations are solved by assuming both the slow-roll approximation and the slow-turn 
approximation (SRST approximation). In the studies CT and AP the slow-roll approximation and a large 
mass hierarchy is assumed. We found that the computations performed in the corresponding studies are 
valid if the conditions in the second column of the Table are satisfied. Moreover, we computed the power 
spectrum of the curvature mode at the end of inflation which are shown in the third column. 

Let us discuss each study of multi-field inflation separately to see how they fit in this overview. Pioneering 
studies of curvature perturbations and isocurvature perturbations in multiple field inflation were performed 
by GW and TG [37, 52], In these papers the general formalism is developed to describe the evolution 
of both types of perturbations. The importance of r? 2 - as carrier of multi-field effects is recognized as it 
turns on the coupling between the curvature and isocurvature perturbations. However, only super-Hubble 
effects are taken into account and moreover, the turn parameter is treated as a slow-roll parameter. A few 
years later it was realized that the role of the turns can be more important and lead to observable features 
which can be tested against the data. In the studies of AP, CT and PT [39, 41, 44] the effects of turns 
during Hubble radius crossing are taken in account and analytical expressions for the power spectra at the 
end of inflation are derived. In the extensive treatment provided by PT [44] the slow-turn approximation 
is introduced and the equations are solved to second order in the slow-roll and turn parameters. One of 
their conclusions is, however, that a large turn rate results in a decay of the isocurvature modes. This can 
be understood since their approximation scheme is actually only valid in case of negligible turns during 
the transition regime and during the super-Hubble regime the isocurvature modes will indeed decay if the 
turn rate is large for a sufficient period of time according to equation (3.40). However, it is not so generic 
that the turn rate is negligible until the end of the transition regime whereafter it becomes large. If the 
turn rate became nonzero during transition, then we have seen that we get different effects. In the papers 
[39, 42, 53, 43] the slow-roll approximation and a large mass hierarchy is assumed which results in an 
effective single field description with a reduced speed of sound. The results of CT apply for constant turn 
rates and the results of AP are more general and in particular valid for transient turn rates, but constant 
turn rates can also described within their formalism. 

Finally, we will elaborate a bit more about the predictions of observables within the two approximation 
schemes. In case of the SRST approximation we have seen that we either get the same predictions for 
canonical single field inflation or we don’t know how to compute the power spectrum at the end of inflation 
and after inflation because of the presence of isocurvature modes. Considering that non-Gaussianities of 
the local form and isocurvature modes are ruled out by observations we expect that the second case is 
ruled out by the data. But of course one can always argue that the turn rate can stay very small for all 
times and the isocurvature modes might decay after inflation. In the case of the slow-roll approximation 
and a large mass hierarchy we have studied the constant turn rj = const and the transient turn for which 
rj 1 - goes to zero at early and late times. For a constant turn we have found a rescaling of the power 
spectrum of scalar perturbations. Since the power spectrum for tensor modes is unchanged this means we 
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get the following values for the observables 

n s - 1 ss -2e + r] H , n t « — 2e, r = 16e*c s . 

This means a constant turn changes the consistency relation between r and nt- Moreover, another obser¬ 
vational consequence is that the bispectrum gets renormalized 

Bn = Bno, 

which means that the relative amplitude of the power spectrum and bispectrum changes with respect to 
canonical single field inflation 7 . Arguably, the largest observable effects are created by a transient turn 
rate during or very close to the transition regime. The change in the power spectrum is given by equation 
(3.48) which tells us that a transient turn will result in oscillations in the power spectrum. If the width of 
the oscillation is small compared to the size of the observable power spectrum, then the power spectrum 
can be described by a canonical single field power spectrum with oscillations. In this case the amplitude, 
spectral indices and tensor-to-scalar ratio are unchanged, but new parameters are needed to describe the 
oscillations. If the width of the oscillation is large compared to the size of the observable power spectrum, 
then this might change the scalar spectral index and the amplitude. Moreover, it has been shown [51] 
that the bispectrum will have correlated oscillations and therefore a joint analysis of both spectra could 
reveal the signatures. 


7 Canonical single field inflation is defined as single field slow-roll inflation with canonical kinetic terms, minimally coupled 
to gravity and with Bunch-Davies initial conditions. 
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Approximation scheme 

The Slow-Roll-Slow- 
Turn (SRST) approxi¬ 
mation 

Used in: 

TG, PT and GW 
[37, 52, 44] 


Regime of validity 

During transition: 

e,rj L ss const and |ry L | < 0.04 


Predicted power spectrum 


If m 2 > | then: 


'Pn 


Hi 


8e*7 r 2 Mp 


1 + 


9('h 


M2 


nr 


In other cases it is unknown be¬ 
cause of the presence of isocurvature 
modes. 


The slow-roll approx¬ 
imation and a large 
mass hierarchy m 2 3> 1 

Used in: 

CT and AP [39, 41] 


One needs m > 10 and the adiabatic con¬ 
ditions: 


In the case of a constant turn rate: 


i] 


V 


<C \J\m 2 - (t/- 1 ) 2 ! 


Ptz = 


Hi 


8e*ir 2 c s Mp c s 


—Vtzo- 


<C |m 2 - ( rj ± ) 2 \. 

Or equivalently: 


< min 


\m 


- {r] ± ) 2 \ |m 2 -(r/- L ) 21 


In the case of a transient turn which 
vanishes at early and late times: 

( k ) = k [ dr u(t) sin(2/cr). 

'7 Z J —oo 


’ \2k/aH — 2] 


w 


< 


±x2 


m 2 — ( r] ) 


Table 3.7: We summarize the approximation schemes and their regime of validity. In the papers we include in 
our overview two different approximation schemes are being used, these are given in the first row. In the studies 
TG, PT and GW (see Table 3.1) the perturbations equations are solved by assuming the SRST approximation (see 
section 3.2.3). In the studies CT and AP the slow-roll approximation and a large mass hierarchy is assumed. In 
the second column we give the conditions to render the computations performed in the corresponding papers valid. 
In the third column we give the analytical predictions of the power spectrum of the curvature mode at the end of 
inflation. Here m 2 = M^ N /H 2 as defined in section 3.2.7. 
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3.4 Conclusion and Discussion 


In this chapter we have studied different papers of multiple field inflation in literature given in Table 3.1. 
The notation and definitions used in these studies differ and therefore we provide dictionaries to translate 
between the papers. These dictionaries can be found in Table 3.3, 3.4, 3.5, and 3.6. In particular the 
definition of the slow-roll and the turn parameters vary among the papers. We define a turn as a deviation 
from a geodesic of the inflaton in field space, which will couple the inflaton to the other degrees of freedom. 
This means the turn parameter is very important to describe the multi-field effects and should be consid¬ 
ered separately from the slow-roll parameters. However, some papers mix up the slow-roll parameters and 
the turn parameters as can be seen in Table 3.3. 

Moreover, we studied the approximation schemes used in the papers which are used to solve the perturba¬ 
tion equations. We find that only two different approximation schemes are used and we study their regime 
of validity and the analytical predictions of the power spectrum of the curvature mode. The overview of 
the approximation schemes can be found in Table 3.7. In the studies TG, PT and GW the perturbations 
equations are solved by assuming the Slow-Roll-Slow-Turn (SRST) approximation (see section 3.2.3). In 
the studies CT and AP the slow-roll approximation and a large mass hierarchy is assumed. The first 
approximation scheme turns out to be valid if the turn rate is negligible during the transition regime. 
This results in a power spectrum for the curvature mode which is either indistinguishable from single field 
inflation or unknown at the end of inflation due to the presence of the isocurvature modes. The second 
case is most likely ruled out by the current experimental data. This means that this approximation scheme 
is very limited in the sense that it can only be used to describe multi-field inflationary models which are 
effectively canonical single field models. The second approximation scheme is valid when the isocurvature 
mass is heavy enough and when certain adiabatic conditions on the turn rate are satisfied. For a constant 
turn rate this results in a renormalization of the power spectrum. In case of a transient turn rate this 
leads to oscillations in the power spectrum. Both cases influence also other observables. This means that 
this approximation scheme can be used to study multi-field inflationary models which might have distinct 
signatures compared to canonical single field models. 

The two approximation schemes do not cover the full range of possible multi-field models. In particular, 
it would be interesting to get more understanding of the intermediate case with semi heavy fields rri > 9/4 
and the possibility of turns. In this case the isocurvature modes will probably decay on super-Hubble 
scales which means it is possible to make predictions which can be related to experiment. First of all 
one could look for a method to solve the perturbation equations analytically. Moreover, it is possible to 
improve our understanding by consulting numerical methods. 

In this overview we focused mainly on the case of two fields. It would be interesting to generalize the 
second approximation scheme to more than two fields where one field is light and the others much heavier, 
for two heavy fields see [54], In this case one could try to integrate out all the heavy fields which we 
expect to induce similar corrections to the final single field version as in the two field case. These cor¬ 
rections will presumably only depend on additional turn rates and masses coming from the additional fields. 

Finally, we should perform numerical studies of concrete models for multi-field inflation in order to check 
the analytical results and to improve our understanding of multi-field inflation under less restrictive con¬ 
ditions. We already made a start with this and describe a numerical study in the next chapter. 
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Chapter 4 


Numerical Study of Concrete Models in 
Literature 

4.1 Introduction 

We have seen in chapter 2 that the latest precision measurements of the CMB put inflation on a firm 
footing. Canonical single field inflation 1 can fit the current CMB data very well, however the precise 
microphysical origin of inflation remains a mystery. Single-field models could be fundamental theories but 
from a theoretical point of view they have their limitations, because they have no ultraviolet completion 
and, with the possible exception of Higgs inflation, no connection with the rest of physics [16]. We are 
interested in the case that single field inflation is an effective description of a more fundamental theory 
containing multiple scalar fields. The current precision data puts strong constraints on the possible 
interactions terms in a multi-field model. However if one can find something in the data which is distinct 
from canonical single field models, this indicates the presence of new physics during inflation and this 
might a big step towards understanding physics of the very early universe. Therefore we would like to 
understand the falsifiable observational signatures of multiple field inflation in the current and future 
data. We have seen in chapter 2 that if the detection of B-mode polarization by BICEP2 is a result of a 
tensor-to-scalar ratio of order r ~ 0(0.1) then this suggests a high Hubble scale of inflation H 10 13 GeH 
and a super-planckian excursion of the inflaton in field space. This latter implication means that the 
dynamics of the inflaton is more sensitive to features in the potential or more generally to any mismatch 
between the geodesics in field space and the valley of the potential. In chapter 3 we have seen that for 
multi-field models where one field is light and the others much heavier compared to the Hubble scale of 
inflation it is possible to integrate out the heavy fields to get an effective single field description under the 
conditions (3.44) and (3.46). When there are sudden turns in the trajectory this will result in oscillatory 
features in the power spectrum and correlated features the bispectrum [51]. In [55] a search for these 
correlated oscillations in the Planck CMB data has been performed which shows that there are hints 
of oscillatory features in the data. However the full data on the bispectrum has not been released yet. 
Interestingly, this shows that the current data might already be precise enough to detect signatures of 
new physics. This motivates us to continue the study of multi-field inflation and much can be learned by 
studying concrete models for multi-field inflation. It is highly interesting to understand what kind of turns 
appear in these models and how they will influence the observables. At the same time these models can 
be used to check if our theoretical understanding is correct. In various models of inflation in supergravity, 
inflation is embedded in a multiple scalar field theory with non-canonical terms. Because of the presence 
of non-canonical kinetic terms the trajectory of the inflaton in field space could easily be curved. We 
study some models from recent papers in the literature [56, 57, 58, 59]. In general, once the heavy fields 
are stabilized, the possibility of presence of multi-field effects due to turns is not taken into account in 

1 Canonical single field inflation is defined as single field slow-roll inflation with canonical kinetic terms, minimally coupled 
to gravity and with Bunch-Davies initial conditions. 
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these papers. We would like to study whether their naive single field truncation is valid within the current 
observational constraints or that we might be able to detect the presence of the additional fields in these 
models. Furthermore we study another model given in [60] which is a toy model for certain aspects of 
axion monodromy for which we would like to study the same question. 


4.2 Characterization of the models 

In this section we give a description of the models we have been studying. The specifications of [56, 57, 
58, 59] are different from [60] and therefore we discuss them separately. 


4.2.1 No-scale supergravity models 


The papers [56, 57, 58, 59] all provide examples of no-scale supergravity models [61] which have a Kahler 
potential of the form 

K = -31n(T + f) + ..., 


and a superpotential W satisfying 


d T W = 0, 


which imply a vanishing cosmological constant. This can be seen by deriving the scalar potential from the 
Kahler potential and the superpotential (see for example [62]) through 


V = e K ( K tt (d T W + Wd T K ) (d f W + Wd f K) - 3|W| 2 + ... 


(T + T) 2 ( —3W \ ( -3 W 


= e 


K 




-3|TK| 2 + ... 


= e K (0+ ...), 


such that we indeed have a vanishing cosmological constant 2 . Terms with additional matter fields are then 
added to the Kahler potential which determine the final scalar potential. The overall exponential factor 
e K in front of the potential will result in a tendency to minimize K and therefore the additional fields get 
probably fixed at zero which means their influence on the inflaton dynamics will be negligible. Of course 
this depends on the precise form of these extra terms and it should be checked that their influence can be 
ignored for each model. The kinetic term for the two components of the complex field T is given by 

K TT dTdT = - 3 ^^dTdT = - (di/j 2 + e ^yfW^I M vd(\> 2 \ ? ( 4 .i) 

(T + T) 2 v 7 

where we split T in a real and imaginary part because the kinetic term only depends on the real part of T 

T = + * v /2730. 

The exponent and the factor yj 2/3 are introduced in order to make the kinetic term belonging to ?/> canon¬ 
ical. We see a non-canonical kinetic term G^ = e+ 2y >/ 2 ^'^l Mp arises naturally from no-scale supergravity 
models. 


Let us now see what matter fields are added to the Kahler potential and what kind of superpotentials are 
used in the papers. In [56, 57, 58] the ‘Cecotti model’ [63] and ‘imaginary Starobinksy model’ are studied 
for which the Kahler potential and superpotential are given by 

K = —3 In (T + T-XX), 

W = 3MX(T — /), 

2 Be aware of the fact that T denotes a matter field and not the tangent vector used in the previous chapter, moreover, dr 
denotes a partial derivative with respect to the field T, i.e. dr = d/dT. 
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where it is assumed that the field X is stabilized at X = 0. However, this is not shown explicitly but a 
reference to [64] is given in which is explained how this can be realized. We expect that the value of / 
plays a role here, but after putting X = 0 its precise value is irrelevant and by a rescaling of T we can put 
/ to one, which we will do later on. Under this assumption the kinetic part of the Lagrangian is given by 
(4.1) and the potential energy density becomes 

V = 3M 2 H = -M 2 M 2 (l- + 1 M 2 e =F 2 V / 273^/ M J>oi 2 

(T + T) 2 4 p \ J / 2 

If the imaginary part of T is zero along the inflationary trajectory then this model coincides with the 
Starobinsky model. If inflation happens along the imaginary part of T instead, one will get a chaotic 
inflationary model. This is exactly what is aimed at in these papers. In order to realize this the real part 
of T should be stabilized and therefore the Kahler potential is modified. This is done in several ways and 
the Kahler potential together with the resulting Lagrangians are listed in Table 4.1. 

In [59] the ‘no-scale inflationary model’ is studied with the following Kahler potential and superpotential 

X X 

K = —3 ln(r + T) + 

V ’ (T + T) 3 ’ 

W = 3 MX(T-f), 

where by studying some slices of the potential it is concluded that the field X is stabilized at X = 0. The 
function / presumably plays a role here, where for small enough values / < 1 the exponential factor in 
front of the potential constraints the value of X, namely e K ~ /! 2 4)‘ . In this paper several stabilization 

terms are considered as parameterized by 0 which represents a line in the (ReT, ImT))-plane. The Kahler 
potential is given in Table 4.1. We do not show the resulting Lagrangian density, because we haven’t 
studied this model in more detail yet. 


4.2.2 A toy model in which a heavy field flattens the potential 

In [60] it is explained how interaction of heavy fields with the inflaton can flatten the potential. The 
motivation comes from string theory which provides many heavy scalar fields and some light scalar fields 
which can play the role of the inflaton. An example of a light field is the axion and the flattening effect 
can be realized in axion monodromy inflation. This is studied in this paper in section 3. As a toy model 
to explain the flattening effect they use the following potential 

L = 3 W + M 2 (^-^o) 2 . (4.2) 


The idea is that the heavy fields V’ will try to minimize the potential at any given value of the light field </>. 
For large values of (f> this will eat the first term and the potential gets flattened. Computing the valley of 
the potential assuming is massive enough and that it takes the value ^(c j i>) you get the following effective 
potential 

j._ M 2 ^o , _ g 2 4> 2 ^ 2 „,2 

1’ .9 29 , 71 TO „9 j.9 , IpQ. 


g 2 (j) 2 + M 2 

There are two different regimes of this effective potential 


g 2 <j) 2 + M 2 ' 


M Vg, 


H 2 


<t> > M/g : V 
(j) « M/g : V^g 2 ^ 2 , H 2 


M 2 i > 2 
3M 2 ’ 


-> 0, 


g wj 

3M 2 


4 > 2 , V* -t V’o- 


Indeed the potential gets very flat in the regime <f> 3> M/g as is visible in Figure 4.1. The bending of the 
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Figure 4.1: The potential as given in equation (4.2). The potential flattens in the regime <j> M/g. 


potential from one regime into the other regime takes place around the value </> rsj M/g. 

For the kinetic terms the following form is assumed 

C-kin = \Gu^)( d Af + 

which also captures the kinetic terms studied in subsection 4.2.1. Two specific examples mentioned in 
the introduction are canonical kinetic terms = 1 and one example of non-canonical kinetic terms 

Gfa 1 ( 1 /’) = jj/j; which is actually introduced to give an example in which there is steepening instead 
of flattening. The non-canonical kinetic term comes without motivation and is furthermore not always 
positive so one should be careful while studying this example. For now we do not worry about the physical 
relevance because we use it as a toy model to see whether there are detectable effects coming from the 
heavy field. 

4.3 Why do we expect to see something and what are the observational 
consequences? 

Before we continue describing our numerical studies we first outline our expectations. As we have seen in 
the previous chapter any mismatch between the geodesics in field space and the minimum of the potential 
leads to turns in the trajectory. In the models of inflation described in sections 4.2.1 and 4.2.2 the kinetic 
terms are of the following form 

C-kin = ^GW(^)(d^) 2 + (4.3) 

The deviation from a geodesic can occur when the kinetic terms are non-canonical, i.e. when G^/t/) 0 1 
and really depends on i/>. In the models from section 4.2.1 the -0-field gets stabilized at a particular value, 
in the model from section 4.2.2 we expect this to happen in the regime </> 3> M/g. To see intuitively 
why we expect turns in case of a stabilized -0-field, suppose we have G= i-. This corresponds 
to an interpretation of the 0-field as a radial coordinate and the 0-field as an angular coordinate. If the 
0 -field takes a constant value then it means the radius is constant and inflation happens only along the 
flat angular direction. We expect that the radial-angular intuition also holds for more general forms of 
G Y 00 (' 0 ) and therefore we expect that the inflaton will have a constant turn rate throughout its trajectory 
if 0 is stabilized, assuming e is constant. 
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Moreover in the model discussed in section 4.2.2 we also expect a turn due to the bend in the potential 
when the inflaton goes from the flat regime to the second regime. In this case we expect a turn rate which 
varies in time. 

4.3.1 Observational consequences 

We will discuss the observational consequences and detectability for both expected types of turns. In case 
of a constant turn rate and a large mass hierarchy we have seen in the previous chapter that the power 
spectrum and bispectrum will get renormalized with respect to the single field spectra (denoted with an 
index 0) 

Vtz = —Vno, 

Cs 

Bn = -?Bno- 

Therefore one could compare the relative amplitudes of the power spectrum and the bispectrum to see 
if there is a deviation from slow-roll. So far only upper bounds on non-Gaussiantity are given and with 
the current constraints a speed of sound below c s = 0.04 is ruled out 3 . If the polarization measurement 
of BICEP2 is a result of primordial gravitational waves with, then an much stronger bound of c s > 0.25 
is derived in [65]. It is hard to detect non-Gaussianity, but future measurements [66] might be able to 
detect non-Gaussianity at a level corresponding to c s < 0.15 at first sight 4 . Moreover, the tensor-to-scalar 
ratio becomes r = 16 ec s and together with a measurement of tensor spectral index m = — 2e this could 
also reveal a value of c s different from 1. Although r might have been detected by BICEP2 [31], we are 
probably not so close to a precise estimate of the value of both quantities. In the near future 5 therefore we 
probably can only measure this signal when c s is below 0.2. Therefore in order to measure the presence 
of the heavy fields we need a reduction in the speed of sound of about 80 percent. However the effective 
single field description is only valid under the assumption that there is a large mass hierarchy. In case the 
mass of the additional field is comparable but larger than the Hubble scale we cannot speak in terms of 
the speed of sound and we should compute the rescaling of the power spectrum numerically. Therefore 
more generally in order to measure the presence of the heavy fields we need the amplitude of the power 
spectrum to increase by a factor of about 5 with respect to the canonical single field prediction. 

In case of a localized turn induced by the bend of the potential we expect oscillations in the power spec¬ 
trum and bispectrum which might be well detectable in a joint analysis of the spectra in the near future 
if the turn happens within or just before the regime where the observable modes cross the Hubble radius 
and if the turn is sudden enough [55], therefore we think that any sudden turn within the observational 
regime cannot be neglected and should be taken into account. 


3 In order to see this we compare equation (97) of [35] with equation (1.1) of [54], This yields C 3 (cy 2 — 1) « | cj 2 for a 
small speed of sound for the type of models we consider, which allows us to read of the graph given in Figure 25 of [35]. This 
means that for a speed of sound of c a < 0.04 we are outside the 2<r confidence region and we find the constraint c a > 0.04. 

4 If one takes equation (98) from [35] and put in A = 1/2 • (c a — 1), which corresponds to the value of C 3 we computed in 
the previous footnote, then one can solve c a for any given value for . At the same time, considering Table 7 from [ 66 ] 
it might be possible in future to constrain with a ) ~ 10, such that an detection of c s = 0.15 or below should be 

possible. For c s = 0.15 we find = —13.6, for c s = 0.1 we find = —31. 

5 In [65] it is argued that c s = 0.47 is the critical value below which one can still differentiate between canonical single field 
models and effective theories with a constant speed of sound and that experimental improvements are necessary to be able 
to detect it. 
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Figure 4.2: A typical (a) trajectory, (b) turn rate and (c) isocurvature mass squared compared to the Hubble 
parameter for the modified Cecotti model studied in section 4.4.1. In this particular case we choose a such that 
4>b = 0.2 M p . 


4.4 Numerical study of the no-scale supergravity inflationary models 
from literature 

We first study the no-scale supergravity models from the literature without modifications as listed in Table 

4.1 and summarize our findings. 

4.4.1 The modified Cecotti model 

The modified Cecotti model is given in the first row of Table 4.1. We pick T = e +y S 2 ^^/ Mp M p + iy/2/S(p 
as in the paper which means that the ± sign in the table becomes a + sign. Note that by redefining 
the fields such that T —» fT we can put / = 1. This model leads to chaotic inflation because there is a 
potential barrier at ipb = —\/3/2 ln(2a)Af p which stabilizes the field ip at some value close to zero. We 
need a > 0 in order to have a potential barrier. Inflation is supposed to happen for ip < ipb because the 
inflaton will first go in the positive ip direction until it comes close to the barrier where it gets obstructed 
and then it inflates along the <p direction. Therefore we should take a < 0.5 such that ipb > 0 to enforce 
that the absolute minimum of the potential stays within reach of the inflaton field to avoid that we end 
up with some nonzero cosmological constant. We vary a such that ipb £ {0.1, 0.2, l,3}M p for which the 
kinetic term for <p will deviate more strongly from canonical for increasing ipb . We find however that there 
is not much difference and for each case the trajectory enforced by the potential is curved with a constant 
turn rate of order ? ?" L = Jy ~ 0.1. The isocurvature mass is about M 2 / H 2 ~ 12 and we find a negligible 
renormalization of the power spectrum. We show one example in Figure 4.2 where we choose ipb = 0.2. 

4.4.2 The imaginary supersymmetric Starobinsky model 

The imaginary supersymmetric Starobinsky model is given in the second row of Table 4.1. Again we 
put / = 1 and take the ± sign to be +. In order to have a potential barrier m has to be negative. In 
the corresponding paper [56] they choose m = — n~ 1 2 1 ~ n such that the potential is minimized at ip = 0 
independent of the value of cp. More generally parameterizing m = —c~ 1 2 1 ~ n we have a potential barrier 
at ipb = y/3/2 In(c)/(n — 1). We take n > 1 and c > 1 such that ipb > 0 in order to have inflation 
without a cosmological constant when ip gets stabilized slightly below 'ipb- We pick n £ {1.1,1.5, 2} and 
for each value of n we choose c equal to, less than and higher than n. In each case we find a very low turn 
rate r/^ = Mr <0.1 and an isocurvature mass of about M 2 /H 2 ~ 4. The results of this model should 
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Figure 4.3: A typical (a) turn rate and (b) isocurvature mass squared compared to the Hubble parameter for the 
imaginary supersymmetric Starobinsky model studied in section 4.4.2. In this particular case we choose n = 1.1 and 
c = 1.3. In (c) the corresponding power spectrum is plotted where the blue line is the full numerical solution and 
the red dashed line the single field approximation. 


therefore be reproduced by the SRST approximation in which case we expect and find indeed a negligible 
correction to the amplitude of the power spectrum compared to the single field approximation. We show 
one particular example in Figure 4.3 where we choose n = 1.1 and c = 1.3. 

4.4.3 The (R + R 2 + i? 4 )-extension of the imaginary Starobinsky inflation 

The (R + R 2 + i? 4 )-extension of the imaginary supersymmetric Starobinsky model is given in the third 
row of Table 4.1. Note that z is in units of Again we put / = 1 and take the ± sign to be +. The 

overall normalization fixes z and therefore the parameter space consists of M, m and n. The potential 
barrier stays the same as in the imaginary Starobinsky model and parameterizing m = — c~ 1 2 1 ~ n we have 
V’b = \/3/21n(c)/(n — 1). We pick n E {1-1,1.5} and for each value of n we choose c equal to, less than 
and higher than n. We get the same results as for the imaginary supersymmetric Starobinsky model: in 
each case we have a very low turn rate = jjjj < 0.1 and an isocurvature mass of about M 2 / H 2 ~ 4. 
Therefore we do not consider it further. 

4.4.4 The no-scale inflationary model 

The no-scale inflationary model is given in the last row of Table 4.1. We haven’t studied this particular 
model yet. In this case if) is not stabilized at a constant value but rather a line in the (ReT, ImT))-plane 
is stabilized. This will probably result in a turn rate which increases or decreases in time. It is interesting 
to see whether the change in the turn rate is visible in the regime when the observable modes cross the 
Hubble radius. Besides a renormalization of the power spectrum this could for example lead to a change 
in the spectral index as well. We should investigate this model in the future. 

4.5 Numerical study of a toy model 

For the first three models given in Table 4.1 we haven’t found so far interesting deviations from the 
single field approximation. We are however also restricted by choosing the value where -0 is stabilized, 
the value of the isocurvature mass and the form of the kinetic terms because we only worked with the 
particular models offered by the papers. In order to have more grip on these parameters we therefore 
continue studying a toy model. The potential must allow for tunability of the isocurvature mass and the 
value where ij} gets stabilized. Furthermore we will also include the potential which has a bend in this toy 
model. We study non-canonical kinetic terms which only depends on one of the fields 1 p i.e. G^ 
but we restrict to the forms found in literature. We can now make the research question more precise. 
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Research question 

We study a toy model described by the the Lagrangian density 

£ = + ^(<V/0 2 - v(<j>,ip), 

for which we allow the following kinetic terms 

G h = 1, G h = ip/M p and G H = e ± ^^. 

The potential energy density V(cj>, ip) is given by 

V = g 2 <p 2 + g 2 (p 2 ip 2 + M 2 (ip - ipo) 2 , 

and two interesting limiting cases are such that the field ip gets stabilized at a constant value ( g = 0) 
or such that it has a bend (g = 0). In the case g = 0 and in the flat regime of the potential given 
by g = 0: can the non-canonical kinetic terms induce a renormalization of the power spectrum with 
a factor of about 5? In the case g = 0: does the bend in the potential lead to a sudden turn in the 
trajectory in the regime where the observable modes cross the Hubble radius? 


The full parameter space of our toy model is given by {g, g , M, ipo} plus the three choices for the kinetic 
terms. Since the overall normalization of the power spectrum is given by experiment, this means the 
overall normalization of the potential is fixed and therefore this reduces the number of parameters by one. 
Moreover, as a starting point in this study we focus on the case g = 0 and the case g = 0, which means 
we are left with two continuous parameters and one discrete parameter for each case. 


4.5.1 The potential with a bend: g = 0 

We get the following potential 

V = g 2 cp 2 ip 2 + M 2 (ip - ipo) 2 , 

which is given in [60] and illustrates that the heavy fields can flatten the potential. This effect is explained 
in section 4.2.2. In the flat regime we can estimate the mass of the isocurvature mode 


Vnn ~ 2 g 2 (p 2 + 2 M 2 , 


Mnn Vnn 

-Jp w 


2 g 2 cp 2 + 2 M 2 

M 2 ipQ 


3 M 2 > 


12 M 2 


where we used that the normal points towards ip because inflation happens in the (p direction. During the 
bend the isocurvature mass will presumably decrease judging the plot of the potential shown in Figure 
4.1. However from this lower bound on the isocurvature mass we are probably safe in saying that the 
mass of the isocurvature mode depends on the value of ipo/M p : the smaller ipo/M p the bigger the isocur¬ 
vature mass with respect to the Hubble scale. Besides the overall normalization of the potential the two 
parameters we can vary are ipo and the ratio M/g. The bigger the ratio M/g the more inflation we have 
in the regime <p <C M/g and the longer it takes for the inflaton to go from ip = 0 to ip = ipo because the 
radius of the bend of the potential increases. The bigger ipo the bigger the change in ip so probably there 
will be more turning in this case. At the same time the isocurvature mass decreases, so here something 
interesting might happen already with canonical kinetic terms. 


Let us now think for a moment about the allowed ranges for the parameters. The mass scales should 
be well below the Planck scale Mpi = \/&kM p ~ 5M p . If the interesting scales cross the Hubble radius 
during the flat regime then the value of the Hubble parameter is given by ^ = yr^r, at (p = ^ it 
becomes half this value. In the flat regime e is small so in order to compensate for e to get the right 
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Figure 4.4: (a) The trajectory, (b) the turn rate and (c) the isocurvature mass squared compared to the Hubble 
parameter. All for the potential with a bend (/i = 1) with canonical kinetic terms studied in section 4.5.1 and 
0o = 3 M p and M/g = 0.1 M p . 


amplitude of the power spectrum we know the Hubble parameter must be small indeed. In the regime of 
the bend it is a bit unclear what the value for e will be but say in the most optimistic case we end up 
with H = 5 • 10~ 5 M p (corresponding to r = 0.2 and such that M0o ~ 8 ■ 10 ~ 5 AI 2 ). This means if we get 
Mnn/H < 10 6 we are safe, so we can choose 0o very close to 0, but of course every time we need to check 
that this condition is satisfied. On the other hand we would like M^ N /H 2 1 in the regime of the bend 
such that the effective description applies. If this does not hold one also needs to do a full computation 
of the bispectrum in order to understand the effects on the observables. Of course it is interesting to see 
what will happen to the power spectrum when you enter the regime M^ N /H 2 ~ 1 but that is a different 
question. We study this potential for several kinetic terms. 

Canonical kinetic terms: = 1 

We choose M/g E {0.1,1,10}M P and 0o E {0.1,l,3}M p . We find that the turn rate is either too low 
to be visible or too far away from the inflationary regime where the observable modes cross the Hubble 
radius: 60 to 53 e-folds before the end of inflation. One example where we have a (too sharp) turn at the 
end of inflation is for 0o = 3 M p and M/g = 0.1 M p is illustrated by Figure 4.4. This would have led to 
oscillations in the power spectrum - although an effective description is not possible here - if it happened 
much earlier. It makes sense that this happens at the end of inflation because if the turn in the potential 
is traversed fast enough it means at the same time that inflation ends quickly afterwards, because the 
bend is positioned close to the end of the trajectory. 

Studying the regime where the observable modes cross the Hubble radius, in each case the turn rate is 
too low compared to the isocurvature mass and you can neglect its influence on the observables. We give 
an example for 0o = 0.1 M p and M/g = 10 M p as illustrated by Figure 4.5. Considering the plot of the 
trajectory in field space this configuration might lead to a decreasing turn rate in the case of non-canonical 
kinetic terms. 

Non canonical kinetic terms: = 0/M p 

This kinetic term is not always positive so we need to be careful. If 0o gets very close to 0 the quantum 
fluctuations of the fields can render the kinetic term negative, so this is something we should keep track 
of during the numerical computations. To omit this difficulty we first study examples in which 0o comes 
not so close to 0 and we choose M/g E {1, 10}M P and 0o E {0.1,0.2,1,3 }M p . We find that if -00 gets 
smaller the 0-field will traverse larger distances in field space. This has to do with the fact that 0 will be 
very large if 0 is small, which is precisely the case if 0o is small. During the regime where the observable 
modes cross the Hubble radius we find a speed of sound very close to 1 for all cases. So the turn rate 
might be large but this is compensated by a much larger isocurvature mass. We illustrate one example 
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Figure 4.5: (a) The turn rate, (b) the isocurvature mass squared compared to the Hubble parameter and (c) the 
trajectory in field space during the 60 to 53 e-folds before the end of inflation. All for the potential with a bend 
(/j, = 1) with canonical kinetic terms studied in section 4.5.1 and ipo = 0.1 M p and M/g = 10M p . 
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Figure 4.6: The trajectory for the potential with a bend (g = 1) with non-canonical kinetic terms studied in 
section 4.5.1 with G^ = ip/M p and ipo = 0.2 M p and M/g = 1 M p . 


ip o = 0.2 M p and M/g = 1 M p in Figures 4.6 and 4.7. Furthermore we do a non-rigorous check of what 
happens if we decrease ipo and we find that we get very large turn rates for which the speed of sound 
reduces considerably. This seems to be an interesting regime and we should do a more careful analysis in 
future. 


Non canonical kinetic terms: Ga 


= e ±y/2/3rl>/M p 


In case of G u = we have a very low turn rate with heavy masses when ipo is small. If you want 

a bigger turn rate you need to increase ipo to a rather high value such that the exponential becomes small, 
but then you will only get some effect at the end of inflation and moreover the isocurvature mass becomes 
extremely low, so this is not what we are looking for. In case of G^ we always have a 

small turn rate because ip > 0, so here you only get c s = 1 to a very good approximation. Conclusion: for 
the kinetic terms G^ = e ± V /2 / 3 b/ M p single field description is fine as long as the isocurvature mass is 
not small. 


4.5.2 The separable potential which stabilizes ip at a constant value: g = 0 

We put g = 0 to get the following potential 

v = - V’o) 2 , 

where one held is stabilized such that inflation is only along one of the fields. The potential is separable 
and therefore much easier to understand. We take M large compared to g such that ip is the heavy 
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Figure 4.7: (a) The turn rate, (b) the isocurvature mass squared compared to the Hubble parameter and (c) the 
speed of sound. All for the potential with a bend (/j = 1) with non-canonical kinetic terms studied in section 4.5.1 
with G 00 = ip/M p and -00 = 0.2M p and M/g = 1 M p . 


field which gets stabilized at the ‘naive’ value ip = tpo. If due to non-canonical terms the turn rate is 
large we expect that the inflaton gets pushed uphill such that the ‘naive’ computation underestimates the 
amplitude of the power spectrum for scalar perturbations. 

Assuming that 0 gets fixed at some value close to V’o we know the normal points in the ^-direction and 
we find 

V NN = M 2 . 

Furthermore for any non-canonical kinetic term of the form G </></> (VO we can deduce in the slow-roll regime 

; = 2JV/2 

■ G h V 

where we used the fact that in this case we have <r ; = y/G^cp’ and plugged this into equations (3.3) and 
(3.6). For constant ij.’ we can integrate this equation to find 


cP(N) = 


l2(N-N e + l) 


G, 


M, 


H> 


pi 


where (p e = such that e = ^, p = 1. This gives the following estimate for the Hubble parameter 

5C0 2 


H A 


3 G, 


at the time where the observable modes cross the Hubble radius. We are therefore interested in the ranges 
of the parameters for which 


50 


3 G, 


H 


H < M < M p . 


For chaotic inflation we have 


H « 5 • 10 -6 Mp fit* y/Gfaf, ■ 10 ~ 6 M p . 


For given kinetic terms the normalization of the power spectrum fixes /i and we can vary the parameters 
M/fi and ipQ. 


Non canonical kinetic terms: G^a, = ip/M p 

We choose M/(i € {50,100,200,300,1000} and 'ipo € {0.01,0.1, l}M p . The speed of sound gets only 
reduced considerably in the case V’o ~ 0.01. For example for ipQ = 0.01M P and M //i = 1000 we hnd 
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Figure 4.8: An example of (a) a trajectory, (b) the reduced speed of sound and (c) the displacement of ip (red 
dashed line) compared to ipo = 0.05 M p (blue line). All for the toy model with the separable potential (g = 1) with 
non-canonical kinetic terms studied in section 4.5.2 with = ip/Mp , ipo = 0.05 M p and M/fi = 300. 


M^n/H 2 ~ 350 and c s ~ 0.85. However this kinetic term becomes negative for negative values for ip 
including the quantum fluctuations so we should do a more careful analysis in this case to check whether 
this example is reliable. For ipo = 0.1 the turn rate is about g 2 - ~ 0.6 and therefore for M 2 /H 2 1 this 

effect is certainly not visible. The constant turn might cause some trouble when the isocurvature mass 
gets closer to the Hubble scale. For example taking ipo = 0.1 M p and M/g = 50 we get M^ N /H 2 ~ 8 
and r] 1 - ~ 0.6. The isocurvature mass is expected to be high enough for the isocurvature modes to decay 
quickly after Hubble radius crossing but the effective theory does not apply. We can compute the power 
spectrum for this case, but we don’t know precisely what will happen to the bispectrum because we cannot 
speak in terms of a speed of sound. On the other hand if we just compute the speed of sound as if we 
are working with an effective single field description it is about c s ~ 0.9 which is too close to 1 to be 
detectable. We show one example where we choose ipo in between the values discussed in this section 
in Figure 4.8 where we take ipo = 0.05 M p and M/g = 300 such that M^ N /H 2 ~ 125 and c s ~ 0.975. 
We also show the displacement of ip from the ‘naive’ value ipo■ The resulting power spectrum is shown 
in Figure 4.9 and compared with the theoretical expected power spectrum given in section 4.3.1 and the 
‘naive’ single field approximation. We see the power spectrum gets renormalized by a few percent which 
is not detectable but motivates us to explore what happens for lower values for ipo in future. 


Non canonical kinetic terms: Ga 


= ±y/2jZiP/M p 


In case of = e VV/ m p we have a very low turn rate even when ipo takes the values such that 
the exponential becomes equal much less than 1. The same applies for G^ = e^ 2 ^^^ Mp . We conclude 

that for the kinetic terms G^ the single held description is fine because the turn rate is 

extremely low. 


4.5.3 The full toy-model 

Let us think for a moment about the full potential 

V = /rV + £W + M 2 (iP - iPof. 


Assuming ip is massive enough and that it takes the value ip(<p) you get 

M 2 ip 0 jr , A T , g 2 (p 2 


ip = 


g 2 (p 2 + M 2 


V(<P,ip) = 


M 2 ipo + mV- 


2 / 2 


g 2 (p 2 + M 2 
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Figure 4.9: The power spectrum for the toy model with the separable potential (g = 1) with non-canonical kinetic 
terms studied in section 4.5.2 with G^ = tl>/M p , ipo = 0.05 M p and M/g = 300. The blue line represents the full 
numerical solution. The red dashed line is the theoretical expected power spectrum including the renormalization 
due to the turn as explained in section 4.3.1. The green dashed line represents the ‘naive’ single field approximation 
in which the correction due to the turn is not taken into account. 


We have again the two regimes 

<j> > M/g : V -> M 2 V>o + g 2 ^ 2 , 


H z 


M 2 i/> o + g 2 (j) 2 


3M 2 


^ -> o, 


cj) < M/g : V -> (g 2 ip o + g 2 ) 4> 2 , H 2 9 4> 2 , 


V’o- 


The potential still gets flatter in the regime 4> 3> M/g but always has the additional chaotic term g 2 4> 2 . 
The reasoning applied to the non-canonical kinetic terms G^ = to the model without g 2 4> 2 

should be still the same so we won’t explore that part of parameter space. In case of G^ = i/j/Mp the 
turn rate might get cranked up so that is something to check. In case of canonical kinetic terms we again 
expect for the same reasons that the turn in the potential is either too mild or otherwise it will be close 
to the end of inflation. But we should check this because the additional chaotic term will speed up the 
inflaton. This region of parameter space should therefore also be explored in future. 


4.6 Conclusion and Discussion 

In this chapter we have addressed the question 6 whether certain types of two-field inflationary models, 
one described by a potential with a bend and the other described by a potential which stabilizes one of 
the fields and where we allow for non-canonical kinetic terms, will lead to observable signatures in the 
CMB data, distinguishable from canonical single field inflation. Based on the explored part of parameter 
space so far we arrive at the tentative conclusion that the single field approximation without taking into 
account the turns is fine and the heavy degrees of freedom present in these models won’t be detectable in 
the current or near future data because: 

1) The turn rate is constant but too low to be distinguishable from a single field model 

2) There is a sudden turn but it happens close to the end of inflation and therefore it does not influence 

the observable modes. _ 

In particular we found no sizable turn rates for the non-canonical kinetic terms G^ = e ± V^7^’/ M p and 

6 The research question is more precisely posed in section 4.5. 
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the single field description ignoring the influence of turns seems to be very good as long as the isocurva¬ 
ture mass is large enough. However for the kinetic term G^ = ip/M p we found hints of large turn rates 
which should be investigated in more detail in future. Moreover although the studied models cannot be 
distinguished from single field models from an observational point of view so far one should actually take 
into account the rescaling of the tensor-to-scalar ratio r when computing the observables in order to study 
the validity of the particular model under consideration. 

The data analysis performed in this chapter has not finished yet, however. In order to arrive at a more 
precise conclusion we need to explore more regions in the parameter space as given in section 4.5 for which 
we expect that it could describe models which allow for observable effects of the additional fields. For 
example in case of the non-canonical kinetic term G^ = ip/M p we should explore what happens if we 
choose V'o closer to 0. Another possibly interesting region in parameter space is when we consider the full 
toy model where both g and g are nonzero. 

Furthermore, going back to the original study of the models in literature outlined in sections 4.2 and 
4.4 we have not studied the ‘no-scale inflationary model’ yet. This model might reveal detectable effects 
because the stabilization used in the respective paper is different than the stabilization used in the other 
models from literature and the toy model we studied in section 4.5. 

Finally in section 4.2.1 where we derived the two-field Lagrangians from the Kahler potential and super¬ 
potential we assumed that the field X can be stabilized at X = 0 and that we can neglect them in the 
final theory. For the same reason that turns can reveal the presence of additional fields we should carefully 
check whether this field is really stabilized there. 



Chapter 5 


Conclusion and Outlook 


5.1 Summary of Conclusions 

In this thesis we have studied two projects. The first project, as discussed in chapter 3, was to compare 
and translate between studies of multiple field inflation in the literature. The notation and definitions 
used in these studies differ and therefore we provided dictionaries to translate between the papers. These 
dictionaries can be found in Table 3.3, 3.4, 3.5 and 3.6. We stressed that the turn parameter is very impor¬ 
tant to describe the multi-field effects and should be considered separately from the slow-roll parameters. 
However, historically the slow-roll parameters and the turn parameters are mixed up as can be seen in 
Table 3.3. Moreover, we studied the approximation schemes used in the papers which are used to solve 
the perturbation equations. We find that only two different approximation schemes are used and we study 
their regime of validity and the analytical predictions of the power spectrum of the curvature mode. The 
overview of the approximation schemes can be found in Table 3.7. We concluded that the SRST approx¬ 
imation scheme is very limited in the sense that it can only be used to describe multi-field inflationary 
models which are effectively simple single field models. The second approximation scheme on the other 
hand, which is a combination of the slow-roll approximation and a assuming a large mass hierarchy, can 
be used to study multi-field inflationary models which might have distinct signatures compared to simple 
single held models. 

The second project, as discussed in chapter 4, was a numerical study of concrete models of multi-held 
inflation from recent papers in the literature. We investigated whether the current and future experiments 
might be able to detect the presence of the additional helds in these models or not. We arrived at the 
tentative conclusion that the single held approximation without taking into account the turns is hne and 
the heavy degrees of freedom present in these models won’t be detectable in the current or near future 
data for two possible reasons. Either the turn rate is constant but too low to be distinguishable from a 
single held model, or there is a sudden turn but it happens close to the end of inflation and therefore it 
does not influence the observable modes. 

5.2 Outlook 

There are many possibilities of new projects as a continuation of this master’s research. First of all the 
two projects discussed in this thesis leave much room for improvement. Moreover, there are extensions to 
these projects which place this work in a broader research area and connect it to future developments. 

The numerical analysis of multi-held models described in chapter 4 is still in its infancy. The analysis has 
not saturated all possibly interesting possibilities yet. Some regions of parameter space might reveal in¬ 
teresting physics and should be explored in more detail. In addition, several aspects of the original models 
on which the data analysis is based, need to be studied more carefully. Moreover, we should continue our 
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quest for multi-field effects in concrete models to understand what kind of turns appear and how they will 
influence the observables. 

The analytical study of multi-field models in chapter 3 does not cover the full range of possible multi-field 
models. In particular, it would be interesting to get more understanding in case that the additional fields 
are heavy enough for the isocurvature modes to decay, but where the effective theory is expected to break 
down. By should study these type of models by means of an analytical analysis in parallel with a numerical 
study. Moreover, we focused mainly on the case of two fields. It would be interesting to generalize the ana¬ 
lytical expressions to the case of a multi-held theory containing one light held and the others much heavier. 

As an extension to these projects, it is highly interesting to learn how to search for signatures of turns in 
the CMB. Assuming an effective single held description with a reduced speed of sound, we should be able 
to match different shapes of the speed of sound to the current and future CMB data as in [55]. When 
more data is released we should be able to falsify models and hnd support for models consistent with 
observation in order to learn more about the physics of the early universe. 

Another important probe of cosmology in the near future will be high precision LSS surveys, e.g. Euclid 
[36]. These surveys will provide a huge amount of cosmological data and are the most promising datasets 
to establish the presence of non-Gaussianity. The next years it will be worth to specialize on many aspects 
related to LSS, considering the fact it will become the main probe for cosmology. For example, the 
interpretation of the measurements comes with some complicating issues of theoretical and astrophysical 
character and it is important to get more theoretical understanding about these problems. In addition, in 
order to deal with the complexity of the data it will be extremely useful to consult numerical methods. 
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Appendix A 

Full Computations of the Analytical 
Approximations 


In this appendix we work out some of the underlying computations of the analytical approximations 
discussed in chapter 3, section 3.3. The names of the sections in this appendix are the same as the 
corresponding subsections in section 3.3. 


A.l Sub-horizon 


We work out the details of the approximation of the low and high frequency in the second part of the 
sub-horizon regime M 2 3> k 2 / a 2 2> H 2 where we assume a constant turn parameter. The perturbation 
equations to zeroth order in slow roll become 

- 2H 2 ^J q T + 4 fH 2 q N = 0, 

if + Hf - 2r) ± Hq r + ^ + H 2 (m 2 - 2 - (r^) 2 )^ q N + 2 fH 2 q T = 0. 

Using the following ansatz for the solution of the perturbation equations 

q T = A _ e -ifdtw- +A+e -ifdtw +j 
q T = B _ e -ifdtu _ + B+e ~iidtu + ^ 

we get the following equation for oj± and the amplitudes A± and B± 

f—u± — ioj± — iHuj± + || — 2 H 2 2f (— iHu± + 2 H 2 ) \ ( A±\ _ 

y 2 f ( iHuj± + H 2 ) — oj± — iu± — iHuj± + || + H 2 (m 2 — 2 — (f) 2 )J \-B±/ 

Assuming we can neglect oj±, Hoj±, H 2 and fH 2 with respect to the other terms we get the following 
solution 


q T + Hq T + 2fHf + 


w± = ^2 + \h 2 (m 2 + 3(f) 2 } ± j-\/if 4 (m 2 + 3(f) 2 ) 2 + 16— ^ ^ H , 
or 2 V / 2 V a z 

-2 ifHuj + 




f-k 2 /a 2 + ’ 


—2 if Hu¬ 


ll 2 {no 2 — (rjf 2 ) + k 2 /a 2 — f 
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and expanding the square root to first order, using m 2 + 3(r/ 1 ) 2 3> k 2 /a 2 H 2 and assuming |77 _L | < m, we 
get 

4 ss % + \H 2 (m 2 + 3(^f) (1 ± 1) ± 4 
± a 2 2 V ' J v ’ a 2 (m 2 + 3(?/- L ) 2 ) 

which is in agreement with the assumption that 6j±, Hoj± can be neglected with respect to the other 
terms, since = —Huj- and Co + <C —Huj + and k 2 /a 2 S> H 2 . 

A.2 Super-horizon 

We derive the solution of the toy model with constant turn rate rj 2 - to zeroth and first order in slow roll. 
To zeroth order the perturbation equations become 

q T "--y = 2„-T V'- VV 

\ 2 : / 


We use the ansatz q 1 = A 1 z p to solve the equation, this yields 


1 (p{p~ 1) - 2 


— 2r]- L (p — 2) 
, , (M 2 o 


z 2 l 2r/- L (p+ 1) p[p- 1) + (^2 - 2 - (t?^) 2 ) i V AJV 


The determinant should be zero 


(p-2)(p+ 1) fp 2 -p+-^2-2 + 3(??- L ) 2 j =0, 


providing us the solutions 


= Az‘ 


T A 

Q = ~ 

z 

and 

q n = 

0, 

4 

II 

to 

and 

q N = 

M 2 

m 

+3(?7 J “) 2 J 

and 

I T = 

3 ± 


-&1 T 

-(1P 2 ’ 


We have only one growing mode p = — 1 which corresponds to IZ ~ 1 and freezes out after horizon crossing. 
The other solutions are decaying solutions if the isocurvature mode is massive enough, with its effective 
mass squared given by M 2 + 3 [p^) 2 H 2 . Note that they are also oscillating rapidly. In order to make more 
clear what these solutions mean, the full solution is given by 



Where A a , B a ~ 5 „ and C a , D a ~ 6^ because when the modes are decoupled (r/^ = 0) this is true. 
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Next we repeat the same computation but then to first order in slow roll. The equations of motion 
for the perturbation equations become 


Til 2 + 3 €h t r, _L / 1 + e H Nt 2 + 3en N 

Q -o —1 = -9-o — q 

\ z 


z* 

NH , 1 + 2e H 

H I o 


z*■ 


( 2 o I / lo\ JV o±/ / l+ e ^T/ | l + ?A 

ym - 2 + e H ~ (f? ) J q = -2?? 1 —-—g H- ^—9 ) 


Using the ansatz q 1 = A 1 z p and assuming first order slow roll and expanding to second order in e the 
matrix becomes 


p(p- 1) —2-3e—6e 2 -2ry x (p-2+(p-3)e+(p-4)e 2 ) 

2r+ (p+l+(p+2)e+(p+3)e 2 ) p(p-l)+(m 2 -2-(j ? - L ) 2 )+(-2(» ? - L ) 2 +2m 2 -3)e+(-3? ? 2 +3m 2 -4)e 2 I \A N 


where we used m 2 = M 2 /H 2 . The four solutions are given by 


^ = 0 , 


11 (r/ ± ) 2 +m 2 2 

p = -1 - e - W '„ —^re and 


4?y J 


p = 2 + e + 


SO? -1 ) 2 + 3m 2 

11(t7“ L ) 2 + m 2 2 


e and 


4 T 9 (? 7-*-) 2 + 3 m 2 

A n 6/7 -1- 


9(r/- L ) 2 + 3?n 2 A T m 2 — (p 2 -) 2 ’ 

p= l(l±JUT (3|7f + =F + 2m ' 2 ~ 3) c + 0 </) 

2 \ V 7 v / 9-4(3(^) 2 + m 2 ) 

j4 t 477- 1 - 4r/ J 


and 


3 ± \/9 — 4 (3(?7 J -) 2 + m 2 ) 79 — 4 (3(??- L ) 2 + m 2 ) ^3 ± 79 — 4 (3(??- L ) 2 + m 2 )^ 


+ 0 ( 7 ), 


where we expanded the last solutions only to first order in e because it became too messy and in case 
of sufficiently high masses these are decaying solutions anyway. The full solution to linear order in e is 
therefore given by 

£ = % (Az- 1 -* + B***) 

Ap 2 - 


+ 62 c - 7 * T/ (1 + =1 / 

3 + 79-4(m 2 + 3(^)2) l 79-4(m 2 + 3(? ? ± ) 2 ) I 


+ ^D 


4?7 J 


1 - 


3 — 79 — 4(m 2 + 3(?7 j -) 2 ) y 79 — 4(m 2 + 3(?7 J -) 2 ) 


l + 7 9 _4(3(^) 2+m 2))_ 7+) 2 + 2 - 2 —3) 


/9-4(3(r7- L ) 2 + m 2 ) 


N xT I id ^ 2+e 


+ <57 l CZ 


\ ( 1 + 7-4(3(,+ )2 +m 2)) _ _L 


(6(ij- L ) 2 +2m 2 -3) 


9-4(3(rj J -)2+m2) 


|(i-- v / 77 3 (77+7 2 ))+-7 


(6(»7 X ) 2 +2m2-3) 


9-4(3(?j i ) 2 +m 2 ) 


to first order in slow roll. Note that we find the same growing and decaying solutions as before and in 
particular that the sourcing of the isocurvature mode by the growing curvature mode is suppressed by e 2 . 


A.3 Transition 


In this section we provide more details of the computations made in section 3.3.3. 
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A.3.1 Case 1: effective single field description in case of a large mass hierarchy 

We derive the effective perturbation equation for IZ , the adiabatic conditions under which the effective 
theory is valid and the computation of the predicted power spectrum. 

Effective perturbation equation for IZ 

As explained in section 3.3.2 we integrate out the heavy modes when z < m. The low frequency solution 
allows us to neglect the time derivatives of S which gives 

+ 77)2 “ ( r ? ± ) 2 + Vh(vh ~ 3 + e) - i]'^ S = 2 '^TV, 

here the semi-colon represents a derivative with respect to number of e-folds N, resulting into the following 
equations of motion for IZ 


WT 7 + 3 - e - 2r) H ) I IZ' + 


4 (v 


-U2 


. dN ' 111 J V ' + 777,2 “ C 7 ?" 1 ) 2 + Vh(vh - 3 + e) - rj' H 

Defining the speed of sound c s as 

4(h" L ) 2 


-K- + 


k 2 


a 2 H 2 


IZ = 0, 


1 


= 1 + 


Jjj? + m 2 - (r / 2 -) 2 + r) H (r] H - 3 + e) - rj' H ’ 


we get the following effective equation 


1 


Cs 


W +{3 -e-2nB-2-y*-> a2H2 


Ik 2 


IZ 


= 0. 


Rewriting this as 


a 3 eH \ c 


IZ'a 3 He\' (? 2 

- —d 2 IZ = 0 

a 2 H 21 ’ 


allows us to derive the following effective quadratic action up to a constant 


r2 / a eH 


S’eff ~ Mi / dNd a x 


C TZ'f - 


a 2 H 2 


{diKf 


where the square of the Planck mass is included because of dimensional analysis. Using our knowledge of 
canonical single field inflation the constant of proportionality is equal to unity. 


Adiabatic conditions 

We assumed that we can neglect the derivatives of S in the equations of motion, which means the effective 
theory is only valid if 

+ m 2 - S , 


3|cS : | < 

IS”I « 


( a 2 H 2 
( k 2 , 2 


| [aW 2+m " (?? ] ) S ■ 

Using IZ' ~ ^jjTZ and S = ^(cj 2 — 1)IZ : and (-4^)’ « — ^ these conditions translate into 

k 2 


( cr 2 -i y + JL_ I 


cj 2 - 1 ' aH 


< 


+ rn z — (rj L ) 2 


a 2 H 2 


(c s 2 ~ 1)” k 2 ( k \ (c s 2 -l)’ k 

^2 -— + ^77? + 2 — - 1 - L _ 3 + i 

c s 2 — 1 ci 2 H 2 \aH J c s 2 — 1 aH 


< 


+ m z — (r/ -1 ) 2 


a 2 H 2 
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where we neglected the slow roll parameters. In addition we can use m 2 2> ~rjp 1 and denote 
u = 1 — cj 2 to simplify the condition to 


U' 

u 

K< \ 

m 2 

- C^) 2 

u ;; 

+2 ( 

k 

\ U’ 


aH 

- 1 — 

u 

/ u 


< 


±\2 


rrr — (/p) 


the latter condition implies that if the following adiabatic conditions are satisfied the effective theory is 
valid 


U’ 

u 


< min 



m 2 — (?/-*■ ) 2 \ 

\2k/aH — 2| J ’ 


It” 

u 


< 


m 


(7 1 ) 2 


independent on the fact whether u is small or not. It is also possible to express everything in term of the 
turn parameter and its derivatives. We have 

2^(1 + «), 

u 

— PS —2£ J_ ’(1 + u) + 4(^- l ) 2 (1 + 3u + 2 u 2 ), 
u 

where we neglected derivatives with respect to m , -Jjj and the slow-roll parameters. Now when a < 1 we 
get therefore approximately 


< \J\m 2 - (j?- 1 ) 2 ! 
<C |m 2 - (j ] L ) 2 1 . 


These conditions ensure that the timescale of the duration of the turn is much bigger than the period of 
the oscillations about the flat direction of the potential. 


Power spectrum 

If the adiabatic conditions are satisfied it is possible to solve the effective single field equation (3.41) for 
IZ. We consider the two cases of a constant turn rate rj 1 - for all times and a transient turn for which 
c s is assumed to go to 1 for the limits r —> —oo and r —> 0. Assuming a constant turn rate, the initial 
conditions for IZ change as explained in section 3.3.1. The low frequency as defined in equation (3.37) 
becomes 

2 _k 2 ( 4(r/- L ) 2 \ _ k 2 c 2 

a 2 V m 2 + 3(r^)V ~ a 2 

where the latter estimate relies on k 2 /a?H 2 <C m 2 . To see this invert the expression for the speed of sound 


c 


2 

s 


= 1 - 


k 2 

TUI 2 


4 (? 7 _L ) 2 

+ m 2 + 3(r/- L ) 2 


ql = (A-) a e i k dtuJ ~ + (A+) Q e^ dto +, 

ql = 


The solution is written as 
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The quantization conditions determine the normalization and can be used to approximate A _ and B+ 


(A-)a = 


1 


y/2cwJZ 


5a and {B + ) a = 


1 


y/2au + 


N 


where we assumed that A + and B- do not contribute much to the amplitude of q r (and therefore assuming 
that at the beginning of the second part of the sub-horizon regime ( 77-*-) 2 <C m 2 .) If we compute an effective 
theory we neglect the rapidly oscillating a = 2 modes, this leads to the following effective equation as 
function of conformal time 


q" + I fc 2 c 2 - 


aPe \ c 


o,\fi 


Q = 0 , 


with q = qj/c s . The initial conditions are given by 


qi = 


c s 


v/2 k 


c s 


and q[ = 


sV2 


In case of a more or less constant turn rate we can neglect the time variation of c s (also the k 2 /a 2 H 2 part) 
and defining the time variable dr c = c s dr the equation becomes 


5 + ^- 


1 d 2 (a ^/e) 
ay~e dr 2 


q = o, 


which is the Mukhanov-Sasaki equation. To zeroth order in slow-roll the solution is given by 

e ikc s T 


qk = 


\j2kc s 


which yields 

On super-horizon scales krc s 

where we used 


Tlk = 


c s 


1 


1 - 


fcrc, 


Akc*r 


\[2eaMp \f2kc s 
0 we therefore get 

1 1 


1 - 


krc < 


Llk = 


H 


\f2eaMp iy2k ?J cjr i\J2ek 3 c s aMp' 


1 


T~L — (1 + 6* + e*), kr c — (1 + £* + €* 


This yields the following power spectrum 

Vp = 


H 2 _ 

8e*7 r 2 c s Mp c s n ° ’ 


which is about — times the power spectrum Vpo one would achieve when naively computing the truncated 
single field version. Actually one should also take into account the change of H * and e* in order to compare 
them. When c s < 1 the modes will cross the horizon earlier which means that H * becomes bigger and 
e* comparable or smaller than the single field version. This means the power spectrum will be enhanced 
even more. We can make an estimate how much H * changes 


H 


*i 


H* o — H'^XN ~ -ff*o ( 1 + c 


1 -c s 

Cs 


which is indeed negligible when c s is close to 1. Furthermore we can compute the spectral indices and r 

_ dlnVp _ dlnVp dN _ -2e + 7j H „ , 

Hs ~ dink ~ dN dink 1-e ~ C 
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d In Vt —2e 

nt = “ _2e ' 

r = 16 e*c s = c s r 0 , 

where we used that 

Vt=2 ( • 

A.3.2 Case 2: full solution to the perturbation equations using the SRST approxi¬ 
mation 

We derive the diagonalization of the perturbation equation for q, the constraints on the validity of the 
approximation scheme and the prediction of the power spectrum. 

Diagonalization of the perturbation equation q 

We start from the equation 


D 2 T q a +k 2 q a +D a b q b = 0, D a b = a z H 


2 TT 2 


'-2 + e - rj H {~ 3 + e + r) H - £ H ) + (r/^) 2 ij 1 - (3 - e - 2 rj H - C 1 )' 
(3 - e - 2 Vh - -2 + e + ^ 


During transition which is only a short period of time we can take the first order slow roll approximation 
(e = €%. r/H, S,h = 0). Assuming in addition the first order slow turn approximation (//^ « const, £-*- = 0) 
we have 


O a 

a 2 H 2 


-2 + e + (j?- 1 ) 2 

. 7 ?“ L (3 — e) 


V (3-e) 
-2 + e+ ^ 


which is approximately constant and therefore it can be diagonalized to 

{U~ l OU) a b 


a 2 H 2 


1 


—2 + e + m 2 0 

0 —2 + e + m+ 


m ± = ^\ (^) 2 + 


m nn 

H 2 


'4(3 -e) 2 (r/^) 2 + 


^NN 

H 2 


- tn L ) 2 


u = 


cos 0 — sin 0 

sin 0 cos 0 


tan 20 = 


2r/ ± (3 - e) 

(r? ±) 2 _ ^ 


Defining 
we get 


,/(t) = (tz-'lVM. 

Z) 2 + A 2 + a 2 H 2 (—2 + e + m 2 ) 0 

0 D 2 + k 2 + a 2 R 2 (—2 + e + rn 2 . 


u 


N 


= o, 


(A.l) 


Constraints on the validity of the approximation scheme 

In the papers [44, 38] equation (A.l) is solved by Hankel functions. However, this equation is only 
equivalent to the Bessel equation if D 2 = d 2 , which is true if there are no turns during the transition 
period. Therefore in general we actually should start from equation (3.22) and diagonalize it such that 
we can solve it with Hankel functions. For two fields R is defined by the following equation 

R! = aHrj ± {° 1 ~ 0 1 )R 
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Define 


then 


rr rN 

a(r) = / dfaHr/ 1 - = / dNr) A 

Jt- Jn- 


R(t) = R(T-)e 
We introduce a new variable 


Q(r) (? oj = 


= R(t- 

4(t) = 


cos a(r) — sin a(r) 
sina(r) cosa(r) 


The equations of motion for q k become 


Qk + k q k + Lljq k — 0, 


with 


Vj{t) = 


cos Aa sin Aa 
— sin Aa cos Aa 


) n ( T ) ( 


cos Aa — sin Aa\ 
sin Aa cos Aa J 


i 


A a = a(r) — a(ju). 


(A.2) 


J J 


Using the variable substitution z = —kr we get 

d 2 ql 




dz 2 +® k+ fc2-&- O' 


Comparing this to the Bessel equation 


2 d 2 y dy 2 n 

z ds +z TU (z ~^y= 0 ’ 


which becomes for the variable x = \fzy 


d 2 x 
dz 2 


+ x - 


/3 2 ~! 


x = 0. 


We can identify 


X - ql : 




4^ (1 — e-n) 2 a 2 H 2 ’ 


where we used a 2 # 2 ~ (1 + 2ey)k 2 / z 2 . Now diagonalizing f3 yields the two Hankel functions as solutions 
to this equation. But this identification only works when (3 is a constant matrix, which means that Aa 
should be constant and therefore approximately zero during transition. This corresponds to neglecting 
the turns which allows us to use equation (A.l) with D 2 replaced by d 2 . We can make an estimate of 
how small the turn must be such that this still a reasonable approximation. Using the first order slow roll 
approximation we can derive 

HKkeV-W-™, 


therefore N- and N .(_ are bounded by 


Hi < k 2 
H 2 + < k 2 


N- <N n - 2.3 
N + > N H + 2.3. 


We have the following upper bound of the absolute value of Aa at any time during transition 


|Aa(iV)| = 


r N 


lN n 


dNi] J 


<|(JV-Ah)|| ?? j 
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or if I] 1 - has approximately a Gaussian shape with its dispersion a about two e-folds or less we have the 
estimate 

|Aa(A)| < cr\/ / 27r|?7 _L | max* 

If we take the beginning and the end of transition 2.3 e-folds from horizon crossing and demand that 
|Aa(IV)| <0.1 at any time during transition we find the following bound on the turn rate 


W 


Ir/^j < 0.04, if rj^ is more or less constant during transition, 

| max < , if rj~ L has approximately a Gaussian shape with a < 2. 

a 


This should agree with the assumption of slow turn. Moreover we still need to assume that (3 is constant 
which in addition demands that iY is constant because for example 


M' t = V TT = <r{V T + ifV N ) = H\ifY + H 2 0(e, vhYh). 

Therefore, the validity of the usual solution in terms of Hankel function depends strongly on whether the 
following conditions are satisfied or not 


e,rj L ps const and |r/ J_ | < 0.04. 


(A-3) 


Predictions 

If these conditions are fulfilled the solutions of the equations of motion for uJ are given by a linear 
combination of the Hankel functions of the first and second kind 


< = V~zH)\z)Ai + Y~zH I \z)B I al 


(A.4) 


with 


+ 9 +3eH + 9 ( „ 


-U2 


4 ' (1 -e H y 4 

,2 


m* 


-U2 


/,jvy2 _ 1 . l-en-ml 9 2 0 (V j 

(u ) --+ {1 _ 6h)2 ~^+3e ff -(l + 2 e H )m -9-^-, 


where we assumed first order slow roll and w? = = ©(lO)?? -1- or m 2 2> rj 2 -. To hrst order in slow roll 

u N is not imaginary as long as 

| — 4(? 7 _l ) 2 . 

The constants A J a and B : a in equation (A.4) have to be matched to the initial conditions for q J a when 
z = 1. The Hankel functions can be estimated in this limit by 


s/zH£\z) = 2 -^/ 4 ), 

VzH^Yz) = 2 -^/ 4 ) ? 


A T a = 0, B T a = J- (cos OSl + sin 96%) e~ iv ^ 2 ~ iX , 


A-a = o, B% = J± (cos < - sin 95 T a ) , 


This yields 
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with A = A — 7r/4. At the end of transition we can use the limit z< 1 and the second Hankel function 
can be estimated as 

^ (i)^ i/2 + ^Tt )] (ir 1 * 

Note that for r = 3/2 we get the familiar solutions z _1 and z 2 . Making use of the fact that ~ 3/2 we 
can neglect the second term for u T and we find 

<(*+) = \/J(co S f»€ + si 


Ua M = \l^ (cos 06% - sin 0<£) e 


TV 


-&»W 2 -iA ( «/ 2T(v N ) / 2 W 1/2 v^(l - icot(jri/ N )) / 2 ' "" 1/2 


V 


7r 


*+/ 


+ 


r(! + ^) W 


Transforming back to we get 


9a («+) = \j^/- (c°s 2 + cos 0 sin 95^ ) f(v T ) 


i/ t -1/2 


7T ( ( 2 A^" 1 / 2 / 2 X -v N -l/2' 

+ \l Jj: (cos 0 sin 68,% - sin 2 95l) f f{v N ) J + g{y N ) 


Qa ( z +) = (cos 2 05„ - cos 0 sin 0<$£) ^/(^ JV ) 
+ (cos 0 sin 95u + sin 2 08 ^ ) f(v T ) 


v N — 1/2 


+ 0(0 (- 


2 \ 


-i/ JV -l/2 N 


2 +/ 


v T —1/2 


Now there are multiple possibilities. If m 2 <C 1 then v N ~ 3/2 and we can neglect the decaying solution 
z v n +i /2 ag we j^ the first term is obviously a growing solution and this might produce a large spectrum 
for the isocurvature modes. If m 2 > | then the real part of v N becomes zero and both terms z ±l/ +1 / 2 go 
like z 1 / 2 and these modes decay like massive modes. We consider both cases separately. 


If m 2 > | 3> r/ 1 - we can approximate cos0 = 1, sin0 = n ^ ^ and 1/7 = | + e* such that we can derive 
the following expression for the curvature and isocurvature perturbations 

*«<*> - H{l ~: 7 U Cu) («- ! ^ i +) (^r 

- g(1 ;:^ Ce,) (- ! ^«) (£)“ ^ iwi ^ 
with C = ~ 0.03649. This yields the following power spectra 


Tn 


V s 


H * (1 + 2 (C - l)e*) (1 + 9 


8 e*TT 2 Mp 


nr 


Hi 


8e*7r 2 Mp 


(1 + 2(C - l)e*) 


9(0: 


,-L\2 


J7T 


If 77 + < lwe can approximate v 1 = 1 + = | + e* such that we get 


0^(*+) = \/(3/2 + e*) ^ 


1+e* 


7T „ ' cT / 2 


9a(^+) = \/^i/( 3 / 2 + +) 5 a v , + 


1+e* 
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The curvature and isocurvature perturbations become therefore 


±LN s t (!_)’■ 


S Q (z + ) 


ka’’\/2k 

H(l-e*)(l + Ce*) 
ka'■ \[Tk 


S N 


z+J 

Z+J 


meaning they evolve independently and we get both a spectrum of curvature modes and isocurvature 
modes at the end of the transition regime 


Vn 


V s 


Hi 


8e*7 r 2 Mp 

H'l 

8e*ir 2 Mp 


(1 + 2 (C — l)e*), 
(1 + 2 (C — l)e*). 



